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Recombiiiational Cloning Using 
Engineered Recombination Sites 



PCTAJS96/10082 



Background of the Invention 
Field of the Invention 

The present invention relates to recombinant DNA technology. DNA and 
vectors having engineered recombination sites are provided for use in a 
recombinational cloning method that enables efficient and specific recombination 
of DNA segments using recombination proteins. The DNAs, vectors and 
methods are useful for a variety of DNA exchanges, such as subcloning of DNA, 
in vitro or in vivo. 

RelatedArt 

Sitespei^recombinases. Site spedfic lecombinases are oizymes that 
arc present in some viruses and bacteria and have been charactetizjed to have both 
endonuclease and ligase properties. These recombmases (along with associated 
proteins m some cases) recognize specific sequences of bases in DNA and 
exchange the DNA segments flaring those segments. The recombinases and 
associated proteins are collectively referred to as "recombination proteins" {see, 
e.g.„ Landy, A., Current Opinion in Biotechnology 5:699-707 (1993)). 

Numerous recombination systems jfrom various organisms have been 
described. See, e.g,, Hoess et al. Nucleic Acids Research 74(6):2287 (1986); 
Abremski et al, J. Biol Chem.26]il):39l (1986); Campbell, I 
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Bacteriol 77<23):7495 (1992); (^et at, J. Biol Chem 267(U):7794 (1992); 
Aiaki etoL.J. Mol Biol 225(1)05 (1992); Maeser and Kahnmaiin (1991) MoL 
Gen Genet 250:170-176). 

Many of these belong to the integrase family of recombinases 
(Argos etoL EMBOl 5:433-440 (1986)). Perhaps the best studied of these are 
the Integrase/afr system from bacteriophage k (Landy, A. Current Opinions in 
Genetics ondDevel. 3:699-707 (1993)), the Cre/loxP system from bacteriophage 
PI (Hoess and Afaremski (1990) laNucleicAdds and Molecular Biology, vol. 4. 
Eds.: Eckstein and Lill^, Bedin-Heidelbeig: Springer-Verlag; pp. 90-109) , and 
die FLP/FRT system from tiie Saccharomyced cerevi^ 2 ]i ciicle plasnud 
(Broach et a!. Cell 2^:227-234 (1982)). 

While these recondnnadon systems have been characterized for particular 
organisms, the related art has only tm^i using recombmant DNA flanked by 
recombination sites, for in vivo recombination. 

Backman (U.S. Patent No. 4,673,640) discloses the in vsvo use of X 
recombinase to reoombine a protdn producing DNA segment by enzymatic site- 
specific recombination using wild-type recombination sites attB and attP, 

Hasan and Szybalski (Gene 5<J:145-151 (1987)) discloses the use of A. Int 
recombinase in vivo for intramolecular recombination between wild type attP 
and attB sites which flank a promoter. Because the orientations of these sites are 
inverted relative to each other, this causes an irreversible flipping of the promoter 
region relative to the gene of interest 

PalazzDio et al Gene 55:25-36 (1990), discloses phage lambda vectors 
hamg bacteriophage X arms that contsun restriction sites positioned outside a 
cloned DNA sequence and between wild-type loxP sites. Infection of K colt cells 
that express the Ore recombinase witti these phs^e vectors results in 
recombination between the loxP sites and the in vivo excision of the plasmid 
replicon, including the cloned cDNA. 

Pds&i et al, (Nucl, Acids Res, 22:2392-2398 (1994)) discloses a metiiod 
for inseitmg into genomic DNA partial e^qxression vectors having a selectable 
maiker, flanked by two ^d-type FRT recognition sequences. FLP site-specific 
recombinase as present in the cells is used to integrate the vectors into tiie 



genome at predetennined sites. Under conditions vAiete the repUcon is 
functional, tiiis cloned genomic DMA can be amplified. 

Bebee et d, (U.S. Patoil No. 5,434,066) discloses tiie use of site-specific 
iecombinases such as Cre for DNA containing two loxP sites is used for in vivo 
recombination between the sites. 

Boyd (NucL Acids Res. 27:817-821 (1993)) discloses a method to 
fedlitate flie cloning of blunt-ended DNA usmg conditions that encourage 
inteimolecular ligation to a dq)hosphoiylated vector fliat contains a wild-type 
lo3cPsite acted i^on by a C^dte-specificiecombinase present in £ col/ host 
cells. 

Wateihousec/al (]^CTlio.93n9mmlNiwleicAcidsRes.21 (9):226S 
(1993)) disclose an in vivo meftod vAnsse light and heavy chains of a particular 
antibody were cloned in different phage vectors between hxP and loxP 511 sites 
and used to transfect new £ co/i cells. Cre, acting in the host cells on the two 
parental molecules (one plasmid, one phage), produced foxir products in 
equilibrium: two different cointegrates (produced by recombination at either /ojcP 
or loxP 511 sites), and two daughter molecules, one of which was the desired 
product. 

In contrast to the other related art, Schlake Sc Bode (Biochemistry 
55:12746-12751 (1994)) discloses an in vivo method to exchange expression 
cassettes at defined chromosomal locations, each flanked by a wild type and a 
spacer-mutated FRT recombination site. A double-reciprocal crossover was 
mediated in cultured mammalian cells by using this FLP/FRT system for site- 
specific recombination. 

Transposases. The family of enzymes, the tcanspOsases, has also been 
used to transfer genetic infomiation between rq>licons. Transposons are 
structurally variable, bemg desoibed as simple or co^^)ound, but typically 
encode the lecombinase goie flanked by DNA sequences organized in inverted 
orientations. Integration of transposons can be random or highly specific. 
Representatives such as Tn7, wluch are highly site-specific, have been applied to 
the in vivo movement of DNA segments between replicons (Lucklow et a/., 
J. Virol 67:4566-4579 (1993)). 



10 
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Devine and Boeke Nucl Acids Res. 22:3765-3772 (1994), discloses the 
construction of artificial transposons for the insertion of DNA segments, in vitrOy 
into recq>ient DNA molecules. Tlie syston makes use of the integrase of yeast 
TYl virus-like particles. The DNA s^mentofint»»st is doped, using standard 
inefhods, between the ends of the transi>oson-like element Intfaei»es^ce 
of the TYl iiitegrase, the resulting element integrates randomly into a second 
target DNA molecule. 

DNA dotting. The cloning of DNA segm^ currentiy occurs as a daily 
routine in mai^ leseaidi labs and as a pr^iequisite step in many genetic analyses. 
The purpose of tiiese clonings is various, however, two general purposes can be 
considered: (1) flic initial clomng of DNA fiwm large DNA or RNA segments 
(chromosomes, YACs, PGR fiagments, mRNA, etc.), done in a relative handful 
of known vectors such as pUC, pGem, pBlueScript, and (2) tiie subcloning of 
these DNA segments into specialized vectors for functional analysis. A great 
15 deal of time and effort is expended both in tiie initial cloning of DNA segm«its 

and in the transfer of DNA segments from tiie initial cloning vectors to the more 
specialized vectors. This transfer is called subcloning. 

ITie basic methods for clonmg have been known for many years and have 
dianged little during tiiat time. A typical cloning protocol is as follows: 

(1) digest the DNA of interest witii one or two restriction 
enzymes; 

(2) gel purify the DNA segment of mterest when known; 

(3) prepate die vector by cuttmg wifli appropriate restriction 
en^mes, tieatmg wifli alkalme phosphatase, gel purify etc., as 

25 i5)propriate; 

(4) ligate flie DNA segment to vector, wifli appropriate controls 
to estimate background of uncut and self-li^Ued vecton 

(5) mtroduce tiie resulting vector into an R coli host cell; 

(6) pick selected colonies and grow small cultures ovemig|it; 
30 (7) make DNA minipreps; and 

(8) analyze the isolated plasmid on agarose gels (often after 
diagnostic restriction enzyme digestions) or by PGR. 
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The specialized vectors used for subcloning DNA segments are 
functionally diverse. These include h\3t are not limited to: vectors for expressii^ 
genes in various organisms; for reguladng gene egression; for providing tags to 
aid in protein purification or to allow tracking of proteins in cells; for modifying 
4ie cloned DNA segment (e.g., generating deletions); for the synthesis of probes 
riboprobes); for the preparation of ten^lates for DNA sequencing; for the 
idmtification of protein coding re^ons; for the fusion of various protein-coding 
' regions; to provide large amounts of tiiB DNA of int»:est, etc. It is common fliat 
a particular investigation will involve subcloning tiie DNA segment of interest 
into several different specialized vectors. 

As known in the art, ^mple subclonings can be done in one day (e,g, , the 
DNA segment is not large and tiie restriction sites are conqiatible with tiiose of 
the subcloning vector). However, many other subclonings can take several 
wed[s, especially those involving unknown sequences, long fiagments, touc 
genes, uiismtable placraent of restriction sites, high backgrounds, impure 
enzymes,e/c. Subcloning DNA fiagments is thus often viewed as a chore to be 
done as few times as possible. 

Several methods for facilitating the cloning of DNA segments have been 
described, e.g., as in the following references. 

Ferguson, X, etal. Gene 16:191 (1981), discloses a family of vectors for 
subcloning fiagments of yeast DNA. The vectors encode kanamycin resistance. 
Clones of longer yeast DNA segments can be partially digested and ligated into 
the subcloning vectors. If the original cloning vector conveys resistance to 
ampiciliin, no purification is necessary prior to transformation, since the selection 
will be for kanamycin. 

Hashimoto-Ootoh, T., et aL Gene 4hV25 (1986), discloses a subclomng 
vector with uruque* cloning sites within a streptomycin sensitivity gene; in a 
streptomycin-resistant host, only plasmids mth inserts or deletions in the 
dominant sensitiWty gene will survive streptomycin selection. 

Accordingly, tracUtional subcloning methods, usmg restriction enzymes 
and ligase, are time consuming and relatively unreliable. Condderable labor is 
expended, and if two or more days later the desired subclone can not be foimd 
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among tiie candidate plasmids, the entire process must then be repeated with . 
alternative conditions attenQ>ted Although site specific recombinases have been 
used to recombine DN A m vrvo, the successful use of sudi enzymes in vitro ivas 
esqiected to sufiferfiom several pioblenis. Forexan^le,tfae^tespecifidtiesand 

5 effidoides wrae ^(pectiedto di£Eer III vitro; topologically-linked products were 

expected; and the topology of the DNA substiates and recombination proteins 
was eiq)ected to difG^ significantly in vitro {see, e.g., Adams et d, J. MoL 
Biol 225:661«73 (1992)). Reactions that could go on for many hours in vivo 
were expected to occur in significantly less time in vitro before the enzymes 

10 became inactive. Multiple DNA recombination products wieree>q)ected in the 

biolo£^cal host used, resulting in unsatis&ctoiy reliability, specificity or 
effidtticy of suibcloning. /nWAv recombination reactions wm not e^qiected to 
be sufficiently efficient to yield the desired levels of product 

Accordingly, there is a long felt need to provide an alternative subcloning 

15 system tiiat provides advantages over the known use of restriction enzymes and 

iigases. 

Summary of the Invention 

The present invention provides nucleic acid, vectors and methods for 
obtaining chimeric nucleic acid using recombination proteins and engineered 

20 recombination sites, in vitro or in vivo. These methods are highly specific, rapid, 

and less labor inten^ve than what is disclosed or suggested in tiie related 
background art The improved specificity, speed and yields of the pres^ 
invention &cilitates DNA or RNA subcloning, regulation or exchange useful for 
any related purpose. Such purposes include in vitro recombmation of DNA 

25 segments and in vitro or in vivo insertion or modification of transcribed, 

replicated, isolated or genomic DNA or RNA. 

The prtesent invention relates to nucleic acids, vectors and methods for 
moidng or exchanging segments of DNA using at least one engineered 
recombination site and at least one recombination protein to provide chimeric 

30 DNA molecules ^^ch have tiie desired charactaistic(s) and/or DNA segment(s). 
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Generally, one or more parent DNA molecules are recombined to give one or 
more dau^ter molecules, at least one of which is the desired Product DNA 
segment or vector. The invention thus relates to DNA, RNA, vectors and 
mediods to effect tbs exchange and/or to select for one or mare desired products. 
5 One embodiment of tiie presoit inv^on relates to a method of making 

chimeric DNA, vdiich comprises 

(a) cotcbudnginvitroorinvivo 

0) <kn Insert Donor DNA molecule, conqirising a dedred DNA 
segment flanked by a first recombination site and a second recombination site, 

10 wherein the first and second recombination sites do not recombine with each 

odier; 

01) a Vector Donor DNA molecule containing a third 
recombination site and a fourth recombination site, viierein the thud and fourth 
recombination sites do not recombine with each odier; and 

15 (iii) one or more site specific recombination proteins capable 

of recombining the first and third recombinational sites and/or die second and 

fourth recombinatioiial sites; 

thereby allowing recombination to occur, so as to produce at least one 

Cointegrate DNA molecule, at least one desired Product DNA molecule which 
20 comprises said desired DNA segment, and optionally a Byproduct DNA 

molecule; and then, optionally, 

(b) selecting for the Product or Byproduct DNA molecule. 
Another embodiment of the present invention relates to a kit comprising 

a carrier or recq^tacle bemg compartmentalized to receive and hold therein at 
25 least one container, wherein a first container contains a DNA molecule 

comprising a vector having at least two recombination sites flanking a cloning 
site or a Selectable marker, as described herein. The kit optionally fiirther 
comprises: 

Q a second container containing a Vector Donor plasmid 
30 comprising a subcloning vector and/or a Selectable maricer of which one 

or both are flanked by one or more engineered recombination sites; and/or 
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0i) a tUid container containing at least one recombination protein 
vAich recognizes and is capable of recombining at least one of said 
recombination sites. 

Other embodiments include DNA and vectors useful in the methods of tiie 
present invention. In particular, Vector Donor molecules are provided in one 
embodiment, wherein DNA segments wWiin Ihe Vector Donor are separated 
either by, CO in a drcular Vector Donor, at least two recombination sites, or (ii) 
in alinear Vector Donor, at ka^ one recomUnation ate, vAeie the recombination 
sites axe preferably en^neered to enhance specifidty or efiSciency of 
recombinatioa 

One Vector Donor embodiment comprises a first DNA segment and a 
second DNA segment, the first or second segment compising a Selectable 
marker. A second Vector Donor embodun«it conq>rises a first DNA s^ment 
and a second DNA segment, tiie first or second DNA segment compridng a toxic 
geAc. A tiurd Vector Donor embodiment comprises a first DNA segment and a 
second DNA segment, the first or second DNA segment comprising an inactive 
fiagmwit of at least one Selectable marker, vsiierein the inactive fiagment of tiie 
Selectable marker is capable of reconstituting a fimctional Selectable marker 
when recombined across the first or second recombination site with another 
inactive fiagment of at least one Selectable marker. 

The present recomlnnational cloning metiiod possesses several advantages 
over previous in vfvo mediods. Since single molecules of recombination products 
can be mtroduced mto a biological host, propagation of die desired Product DNA 
in the absence of other DNA molecules (e.g. , starting molecules, mtermediates, 
and by-products) is more readily realized. Reaction codditions can be fi:eely 
adjusted in vitro to optimize esayme activities. DNA molecules can be 
incon^)at^ble witiithe desired biological host (e.g., YACs, genomic DNA, etc.), 
can be used. Recombination proteins fix>m diverse sources can be mployed, 
together or sequentially. 

Other embodiments will be evident to those of ordinary skill in the art 
fiom tiie teachings contained herein in combination with what is known to the art 
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Bri^ Description of the Figures 

Figure 1 depicts one general method of the present invention, wherein the 
starting (parent) DNA molecules can be circular or linear. The goal is to 
exchange the new subcloning vector D for the original cloning vector B, It is 
dearable in one embodiment to select for AD and against all the other molecules, 
including the Cointegrate. The square and drcle are sites of recombination: e.g, , 
loxP sites, an sites, etc. For example, segmoat D can contain ejcpression signals, 
new drug markers, new origins of replication, or specialized functions for 
mapping or sequencing DNA. 

Figure 2A depicts an in vitro method of recombining an Insert Donor 
plasmid Oierc, pEZX:705) with a Vector Donor plasnud (here, pEZC726), and 
obtaining Product DNA and Byproduct daughter molecules. The two 
recombination ates are attP and loxP on the Vector Donor. On one segment 
defined by tibese ates is a kanamycui resistance gene whose promoter has been 
replaced by the tefOP operator/promoter fiomtransposonT/ilO; feeSizcmore 
et d., NucL Acids Res, i5(10):2875 (1990). In the absence of tet repressor 
protem, E coli RNA polymerase transcribes the kanamycin resistance gene from 
the tetOP, if tet repressor is present, it binds to tetOP and blocks transcription of 
tiie kanamycin resistance gene. The otiier segment of pEZC726 has tiie tet 
repressor gene expressed by a constitutive promoter. Thus cells transformed by 
pEZC726 are resistant to chloran^)henicol, because of tiie chloramphenicol acetyl 
transferase gene on tiie same segment as tetR, but are sensitive to kanamydn. 
The recombinase-mediated reactions resuh m separation of ttie tetR gene fiom tiie 
regulated kanamydn resistance gene. This separation results in kanamycin 
r^istance in cells receiving only tiie deared recombination products. The first 
recombination reaction is driven by tiie addition of tiie recombinase called 
Integrase. The second recombmation reaction is driven by adding flie 
recombinase Cie to tiie Cointegrate (here, pEZC7 Comtegrate), 
Figure 2B depicts a restriction map of pEZC705. 
Figure 2C depicts a restriction m^ of pEZC726. 
Figure 2D depicts a restriction map of pEZC7 Cointegrate. 
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Figure 2E depicts a restriction map of Intprod 
Figure 2F depicts a restriction map of Intbypro. 
Figure 3 A depicts an in vitro method of recombining ain Insert Donor 
plasmid (here, pEZC602) with a Vector Donor plasmid (here, pEZC629), and 
obtaining Product (hsxG, EZ;C6prod) and Byproduct (here, EZC6Bypr) daughter 
molecules. The two recombinaticm sites are /flocP and /orP 577. One segment of 
pEZC629 defined by these sites is akanamycm re^stance gene whose promoter 
hasbeeniq>laoedby&eterOPoperatoc^mot^fomtransposon7iilO. In the 
absence of tet xqiressor protein, K coli RNA polymerase transcribes the 
kanamydn resistance gene fiomfliel^O?. Iftet repressor is present, it binds to 
tetOP and blodcs transcription of the kanamycin resistance gene. The other 
segment of pEZC629 has die tet r^nsssor goie expressed by a constitutive 
promoter. Thus cells transformed by pE2X)629 are resistant to chlorampheiucol, 
because of the chloramphenicol acetyl transferase gene on the same segment as 
tetRy but are sensitive to kanamycin. The reactions result in sqparation of the tetR 
gene from the regulated kanamycin resistance gene. This separation results in 
kanamycin resistance in cells receiving die desired recombination product The 
first and the second recombination events are driven by the addition of the same 
recombinase, Cre. 

Figure 3B depicts a restriction m^ of EZC6Bypr. 
Figure 3C depicts a restriction map of £ZC6prod. 
Figure 3D depicts a restriction map of pEZC602. 
Figure 3£ depicts a restriction map of p£ZC629. 
Figure 3F depicts a restriction map of EZC6comt 
Fl^re 4A dq)icts an application of the in vitro method of 
recomblnational cloning to subclone the chlorampheiucol acetyl transferase gene 
into a vector for expression in eukaryotic cells. The Insert Donor plasmid, 
pEZC843, is comprised of the chloramphenicol acetyl transferase gene of E. coli, 
cloned between loxP and attB sites such that die loxP site is positioned at the 
5'-end of the gene. The-Vector Donor plasmid, pEZClOOS, contains the 
cytomegalovirus eukaryotic promoter apposed to a loxP site. The supercoiled 
plasmids were combined with lambda Integrase and Cre recombinase in vitro. 
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After incubation, competent E. coli cells were transformed vdth the 
recombinational reaction solution. Aliquots of transformations were spread on 
agar plates containing kanamycin to select for the Product molecule Oiere 
CMVProd). 

5 Figure 4B depicts a restriction map of pEZC843. 

Figure 4C depicts a restriction map of pE2X^1003. 
Figure 4D depicts a restriction map of CMVBypro. 
Figure 4E dq}icts a restriction map of CMVProd. 
F^re 4F depicts a restriction msqp of CMVcoint 
10 Figure 5A depicts a vector diagram of pEZClSOl. 

Figure 5B dq)icts a vector diagram of pEZC1305. 
Figure 5C depicts a vector diagram of pEZC1309. 
Figure 5D depicts a vector dis^iam of p£ZC13 13. 
Figure 5E depicts a vector diagram of pEZC1317. 
15 Figure 5F depicts a vector diagram of pEZC1321 . 

Figure 5G depicts a vector diagram of pEZC1405. 
Figure 5H depicts a vector diagram of pEZC1502. 
Figure 6A depicts a vector diagram of pEZC1603. 
Figure 6B depicts a vector diagram of pEZC1706, 
20 Figure 7A depicts a vector diagram of pEZC2901 . 

Figure 7B depicts a vector diagram of pEZC2913 
Figure 7C depicts a vector diagram of pEZC3101. 
Figure 7D depicts a vector diagram of pEZC1802. 
Figure 8A depicts a vector diagram of pGEX-2TK. 
25 Figure 8B depicts a vector diagram of pE2!X:3501; 

Figure 8C depicts a vector diagram of pEZC3601 . 
Figure 8D depicts a vector dii^ram of pEZC3609. 
Figure 8E dq)icts a vector diagram of pEZC3617. 
Figure 8F depicts a vector diagram of pEZC3606, 
30 Figure 8G depicts a vector diagram of pEZC3613. 

Figure 8H depicts a vector diagram of pEZC3621 . 
Figure 81 depicts a vector diagram of GST-CAT. 
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Figure 8J depicts a vector diagram of GST-pho A. 
Figure 8K depicts a vector diagram of p£Z£^320i . 

Detailed Description of the Preferred Embodiments 

It is unexpectedly discovered in the present invention that subcloning 
reactions can be provided using recombinational cloning. Recombination cloning 
according to the present invention uses DN As, vectors and methods, in vitro and 
in vivo, for moving or exchanging segments of DNA molecules using engineoed 
recombination ^tes and recombination protdns. Hiese methods provide diimeric 
DNA molecules that have the dedied characteristic(s) and/or DNA segment(s). 

Hie present inv^on thus provides ruicleic add, vectors and methods for 
obtaining chimeric nucldc add using recombination proteins and engineered 
recombination dtes,mv</ro or ill vn^o. These mettiods are highty specific, rapid, 
and less labor intensive than what is (Closed or suggested in the related 
background art The improved spedficity, speed and yields of the present 
invention fedlitates DNA or RNA subcloning, regulation or exchange useful for 
any related purpose. Such purposes include in vitro recombination of DNA 
segments and in vitro or in vivo insertion or modification of transcribed, 
replicated, isolated or genomic DNA or RNA. 

Definitions 



In the description that follows, a number of terms used in recombinant 
DNA technology are utilized extensively. In order to provide a clear and 
consistent understanding of the specification and daims, including the scope to 
be ^ven such terms, the following definitions are provided. 

Byproduct is a daughter molecule (a new done produced after the 
second lecombmation event during the recombinational clonmg process) lacking 
the DNA which is desired to be subcloned. 
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Cointcgrate: hat least one recombination intermediate DNA molecule 
of the present invention that contains both parental (starting) DNA molecules. 
It wU usually be circular. In some embodiments it can be linear. 

Host: is any prokaryotic or eukaiyotic organism that can be a recipient 
of the recombinational clomng Product A "host," as the term is used herein, 
includes prokaryotic or eukaryotic orgamsms that can be genetically engineered. 
For examples of such hosts, see Maniatis et al, Molectdar Chning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1982). 

Insert: is the desired DNA segment (segmenti4 of Figure 1) vAich one 
wshes to manipulate by file meflibd of fee present inventiorL Tbe insert can have 

one or more genes. 

Insert Donon is one of the two parental DNA molecules of the present 
invention which carries the Insert The Insert Donor DNA molecule comprises 
fee Insert flanked on bofe ades vdfe recombination signals. The Insert Donor 
can be linear or circular. In one embo(toent of fee invention, fee Insert Donor 
is a circular DNA molecule and further comprises a cloning vector sequence 
outside of fee recombination signals (see Figure 1). 

Product is one or bofe fee desired daughter molecules comprising fee A 
and D or B and C sequences which are produced after fee second recombination 
event during fee recombinational cloning process (see Figure 1). The Product 
contains fee DNA which was to be cloned or subcloned. 

Promoter: is a DNA sequence generally described as the 5 -re^on of a 
gene, located proximal to fee start codon. The transcription of an adjacent DNA 
segment is initiated at fee promoter region. A represable promoter's rate of 
transcription decreases in response to a repressing agent Ah indudbte promoter's 
rate of transcription increases in response to an inducing agent A constitutive 
promoter's rate of transcription is not specifically regulated, feough it can vary 
under fee influence of general metabolic conditions. 
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Recognition sequence: Recognition sequences are particular DNA 
sequences which a protein, DNA, or RNA molecule (e,g., restriction 
endonuclease, a modification methylase, or a recombinase) recognizes and binds. 
For example, the recognition sequence for Cre recombinase is /oxP vAdch is a 34 
base pair sequence comprised of two 13 base pair inverted rq)eats (sendng as the 
recombinase binding sites) flanking an 8 base pair core sequCTce. &eFiguiel 
of Sauer, Current Opinion in Biotechnology 5:521-527 (1994), Other 
exan:^)les of recognition sequences are the MB, attP, OttLy and ottR sequences 
which are recognized by die recombinase enzyme X Integrase. attB is an 
approximately 25 base pair sequence containing two 9 base pair core-type Int 
landing shes and a 7 base pair overlap r^on. oT/P is an proximately 240 base 
psai sequence containing core^type bit binding sites and arm-type Int binding 
sites as well as sites for auxiliazy proteins IHF, FIS, and Xis, See Landy, Current 
^Opinion in Biotechnology 5:699-707 (1993). Such sites are also engineered 
15 according to the present invention to enhance methods and products. 

Recombinase: is an enzyme vdiich catalyzes the exchange of DNA 
segments at specific recombination sites. 

Recombinational Cloning: is a method described herein, whereby 
segments of DNA molecules are exchanged, inserted, replaced, substituted or 
20 modified, in V///-0 or i/7 vfvo. 

Recombination proteins: include excisive or integrative proteins, 
enzymes, co-j&ctors or associated proteins tiiat are involved in recombination 
reactions involving one or more recombination sites. See, Landy (1994), infra. 
Repression cassette: is a DNA segment that contains a repressor of a 
25 Selectable marker present in the subcloning vector. 

Selectable marker: is a DNA segment that allows one to select for or 
agmnst a molecule or a cell that contains it, often under particular conditions. 
These markers can encode an activity, such as, but not limited to, production of 
RNA, peptide, or protein, or can provide a binding site for RNA, peptides, 
30 proteins, inorganic and organic compounds or compositions and the like. 

Examples of Selectable markers include but are not limited to: (1) DNA 
segments that encode products which provide resistance against otherwise toxic 
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compounds (eg., antibiotics); (2) DNA segments that encode products whidi are 
otiiCTwise lacking in the recipient ceU (e.g., tRNA genes, auxotrophic markers); 
(3) DNA segments tiiat encode products vMch suppress the activity of a gene 
product; (4) DNA segments that encode products which can be readily identified 
(e.g., phenotypic markers such as p-galactosidase, green fluorescent protem 
(GFP), and ceU surfece proteins); (5) DNA segments tiiatWnd products which are 
otherwise detrimental to cell survival and/or function; (6) DNA stents that 
oth^wise irihibit flxe activity of any of the DNA segments described in Nos. 1-5 
above (€.g., antisense oUgomicleotides); (7) DNA segmoits tiiat bind products 
tiiat modify a substrate (e.g. restriction endonucleases); (8) DNA segments tiiat 
can be used to isolate a deared molecule (eg. ^ific protem binding sites); (9) 
DNA s^ments fliat racode a specific nucleotide sequence wfaidi can be 
otiierwise non-fimctional {e.g., for PGR an^jlification of subpopulations of 
molecules); and/or (10) DNA segments, which when abscait, directiy or indirectiy 
confer scnsitiidty to particular compoimds. 

Selection scheme: is any method which allows selection, enrichment, or 
identification of a desired Product or Product(s) from a mixture containing tiie 
Insert Donor, Vector Donor, and/or any intermediates, (e.g. a Cointegrate) 
Byproducts. The selection schemes of one preferred embodiment have at least 
two components tiiat are eitiier linked or unlinked during recombinational 
cloning. One component is a Selectable maricer. The otiier component controls 
tiie caq)ression in vitro or in vivo of tiie Selectable maricer, or survival of tiic cell 
harboring tiie plasnud carrj^ig tiic Seledsible maricer. Generally, tiiis controlling 
element will be a repressor or inducer of tfie Selectable maricer, but otfier means 
for controlling expression of tiie Selectable marker canlje used. Whether a 
repressor or activator is used will depend on wheflicr flie maricer is for a positive 
or negative selection, and tiie exact arrangement of tiie various DNA segments, 
as will be readily apparent to tiiose sldlled in tiie art A preferred requirement is 
fliat tiie selection scheme results in selection of or enrichment for only one or 
more desired Products. As defined herein, to select for a DNA molecule includes 
(a) selecting or enriching for tiie presence of tiie desired DNA molecule, and (b) 
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selecting or enriching against the presence of DNA molecules that are not the 
desired DNA molecule. 

In one embodiment, the selection schemes (which can be carried out 
reversed) will take one of three forms, which wll be discussed in terms of 

5 Figure 1 . The first, exemplified herein with a Selectable marker and a repressor 

therefor, selects for molecules having segment D and lacking segment C. The 
second selects agmnst molecules having segment C and for molecules having 
segment D. Possible embodiments of the second form would have a DNA 
segment canymg a gene toxic to cells into which die in vitro reaction products 

10 are to be introduced A toxic gene can be a DNA that is e3q)ressed as a toxic gene 

- product (a toxic protein or RNA), or can be toxic m and of itself. (In the latter 
case, the toxic gene is und^stood to cany its classical definition of "lieritable 
trait".) 

^camples of such toxic gene products are well known in the art, and 
15 include, but arc not limited to, restriction endonucleases (e.g., DpnJ) and genes 

that kill hosts in the absence of a suppressing fimction, e.g., kicB. A tome gene 
can alternatively be selectable in vitro, e.g. , a restriction site. 

In the second form, segment D carries a Selectable marker. The toxic 
gene would eliminate transformants harboring the Vector Donor, Cointegrate, and 
20 Byproduct molecules, while the Selectable marker can be used to select for cells 

containing the Product and against cells harboring only the Insert Donor. 

The thud form selects for cells that have both segments AmdD'mcison 
die same molecule, but not for cells tiiat have both s^m^ in trans on different 
molecules. This could be embodied by a Selectable marker that is split into two 
25 inactive fi:agments, one each on segments^ and D. 

The fi:agment5 are so arranged relative to the recombination sites that 
vfbsa the segments are brought together by the recombination event, they 
reconstitute a fimctional Selectable mark^. For example, the recombinational 
event can link a promoter vitii a structural gene, can link two fragments of a 
30 structural gene, or can link genes that encode a heterodimeric gene product 

. needed for survival, or can link portions of a replicon. 
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Site-specificirecombinase: isatypeofrecwmbinase which typically has 
at least the following four activities: (1) recognition of one or two specific DNA 
sequences; (2) cleavage of said DNA sequence or sequences; (3) DNA 
topoisomeiase activity involved in strand exchange; and (4) DNA ligase activity 
to rcseal the cleaved strands of DNA. See Sauer, B., Current Opinions in 
Biotechnology 5:521-527 (1994). Conservative ate-specific recombination is 
distinguished fiom homologous recombination and transposition by a hi£^ degree 
of specificity for botii partners. The strand exchange mechanism mvolves the 
cleavage and rejoining of spedfic DNA sequences in the absence of DNA 
syiifliesis (Landy, A. (1989) Ann, Rev, Biochem. 5«:913.949). 

Subcloning vector: is a doiung vector comprising a circular or linear 
DNA molecule whidi includes an appropriate leplicon. In the present invention, 
the subcloning vector (segment D m Figure 1) can also contain functional and/or 
regulatory elements that are desued to be incorporated into the final product to 
act \xpon or with the cloned DNA Insert (segmental in Figure 1). The subcloning 
vector can also contain a Selectable marker (contained in segment C in Figure 1). 

Vector is a DNA that provides a useful biological or biochemical 
property to an Insert. Examples include plasmids, phages, and other DNA 
sequences \^iiich are able to replicate or be replicated in vitro or in a host cell, or 
to convey a desired DNA segment to a desired location witiiin a host cell A 
Vector can have one or more restriction endonuclease recognition sites at which 
the DNA sequences can be cut in a determinable fiishion witiiout loss of an 
essential biological Amotion of the vector, and into ^ch a DNA fiagment can 
be spliced in order to bring about its repUcation and cloning. Vectors can further 
im)vide primer sites, e.g., for PGR, transcriptional and/or ttanslational initiation 
and/or regulation sites, recombinational ^gnals, replicons, Selectable markers, 
etc, Cleariy.metiiodsofinserting a desired DNA fiagmentwMch do not requke 
the use of homologous recombination or restriction en^rmes (such as, but not 
Hmited to, UDG clonmg of PGR fiagments (U.S. Patent No. 5,334,575, entirely 
moorporated herdn by reference), T: A cloning, and the like) can also be applied 
to clone a fi:agment of DNA mto a cloning vector to be used accordmg to tiie 
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present inventioiL The cloning vector can further contain a Selectable marker 
suitable for use in the identification of cells transformed widi the cloning vector. 

Vector Donor: is one of the two parental DNA molecules of the present 
invention vMch carries the DNA segments encoding the DNA vector vMdti is to 

S become part ofthe desired Product The Vector Donor con^srises a subclomng 

vector D (or it can be called the cloning vector if the Insert Donor does not 
akead^ contain a cloning vector) and a segment C fkmked fay recombination sites 
(see Figure 1)« Segm^ts C and/or D can contain elemmts fliat contribute to 
selection fc^ the desued Product dau^ter molecule, as d^ctibed above for 

10 selection schmes. The recombmation signals can be the same or different, and 

can be acted «qx>n by the same or different xecombinases. In addition, the Vector 
Donor can be linear or circular. 

Description 

One general scheme for an in vitro or in vivo method of the invention is 
shown in Figure 1, where tiie Insert Donor and the Vector Donor can be either 
circular or linear DNA, but is shown as circular. Vector D is exchanged for the 
original cloning vector A, It is desirable to select for the daughter vector 
containing elements A and D and against other molecules, includii^ one or more 
Cointegrate(s). The square and circle are different sets of recombination sites 
(e.g., lox sites or att sites). Segment ^4 or D can contain at least one Selection 
Marker, expression signals, origins of replication, or specialized functions for 
detecting, selecting, expressing, mapping or sequencing DNA, where D is used 
in this example. 

Examples of desired DNA segments that can be part of El^enti4 or D 
include, but are not limited to, PCR products, large DNA segmoits, genomic 
clones or fiagments, cDNA clones, functional elements, e/c., and genes or partial 
genes, vdiich encode useful nucleic acids or proteins. Moreover, tise 
recombmational cloning of the present invention can be used to make ex vivo and 
in vivo gene transfer vehicles for protdn e3q)ressi6n and/or gene then^y . 
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In Figure 1, the scheme provides the desired Product as containing 
vectors 2> and as follows. The Insert Donor (containing A and E) is first 
recombined at the square recombination sites by recombination proteins, with the 
Vector Donor (containing C and 2>), to form a Co-integrate Imving each of A-D- 
C-B. Next, recombination occurs at the circle recombination sites to form 
Product DNA {A and D) and Byproduct DNA (C and B). However, if desired, 
two or nu>re different Co-integrates can be formed to generate two or more 
Products. 

In one embodiment of die present in vitro or in vivo lecombinational 
cloning method, a method for selecting at least one desired Product DNA is 
provided. This can be understood by consideration of the map of plasmid 
pEZC726 depicted in Figure 2. Ihe two exemplary recombination sites are ol/P 
and/or?. On one segment defined by tiiese sites is a kanamycin resistance gene 
whose promoter has been replaced by the tetOP operator/promoter fiom 
transposon TulO. In the absence of tet repressor protein, K coii RNA polymraase 
transcribes the kanamycin resistance gene firom the tetOP. If tet repressor is 
present, it binds to tetOP and blocks transcription of the kanamycin resistance 
gene. The other segment of pEZC726 has the tet repressor gene expressed by a 
constitutive promoter. Thus ceils transformed by pEZC726 are resistant to 
chloramphenicol, because of the chloramphenicol acetyl transferase gene on the 
same segment as tetR, but are sensitive to kanamycin. The recombination 
reactions result in separation of the tetR gene from the regulated kanamycin 
resistance gene. This separationresults in kanamycm resistance in cells receiving 
the desired recombination Product 

Two different sets of plasmids were constructed to demonstrate the 
mW/rometiiod. One set, for use with Crerecombinase only (clonmg Vector 602 
and subcloning vector 629 (Figure 3)) contained loxP and loxP 511 sites. A 
second set, for use with Cre and integcase (cloning vector 70S and subcloning 
vector 726 (Figure 2)) contaiiKd loxP and att sdtes. The efticiency of production 
of the desired daughter plasmid was about 60 fold higher using both mizymes 
than using Cre alone. Nineteen of twenty four colonies fix>m the Cre-only 
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reaction contained the desired product, while thirty eight of thirty eight colonies 
6om die integrase plus Cre reaction contained the desired product plasmid 

Other Selection Schemes A variety of selection sdiemes can be used that 
are known in die art as they can suit a particular puipose for v^ch the 
recombinational cloning is carried out £)ependingiq)on individual preferences 
and needs, a numb^ of different types of selection schemes can be used in the 
lecombmational cloning method ofthepresoit invention. The skilled artisan can 
take advantage of the availability of the many DNA segments or methods for 
making tiiem and tiie dififerent metiiods of selection that are routinely used in the 
art Such DNA segments include but are not limited to those wfaidii^codes an 
acti^ty sudi as, but not limited to, production of RNA, peptide, or protein, or 
providing a bmding she for such RNA,p€^ytide, or protein. Examples of DNA 
molecules used in devising a selection sdieme are given above, under the 
defimtion of "selection scheme" 

Additional examples include but aie not limited to: 

0) Generation of new primer sites for PGR {e,g„ juxtaposition of two 
DNA sequences that were not previously juxtaposed); 

(ii) Inclusion of a DNA sequence acted upon by a restriction 
endonuclease or other DNA modifying enzyme, chemical, 
ribozyme, etc.; 

(iii) Inclusion of a DNA sequence recognized by a DNA binding 
protein, RNA, DNA, chemical, etc.) (e.g., for use as an affinity 
tag for selecting for or excluding from a population) (Davis, NucL 
Acids Res. 24:702-706 (1996); J. Virol 69: 8027-8034 (1995)); 

0v) /«v/fro selection ofRNAligands for flieribosomalL22 protein 
associated wifli Epstem-Bair virus-expressed RNA by using 
randomized and cDNA-derived RNA libraries; 

(vi) The positioning of functional elements whose acti^ty requires a 
specific orientation or juxtapodtion (e.g., (a) a recombination site 
vrtiich reacts poorly in trans, but when placed in cis, in the 
presence of the impropriate proteins, results in recombination tiiat 
destroys certain populations of molecules; (e.g., reconstitution of 
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a promoter sequence that allows in vitro RNA synthesis). The 
FN A can be used directly, or can be reverse transcribed to obtain 
the desired DNA constnict; 
(viO Selection of the desired product by size (e.g., fractionation) or 

other physical property of die mplecule(s); and 
(viii) Includon of a DNA sequence reqiured for a specific modification 

{e.g., mediylation) that allows its identification. 
Afier formation of the Product and Byproduct in tiie mediod of the present 
invention, tiie sdection stq> can be carried out dih^ in vitro or in vivo depending 
t^n tiie particular selection sdieme vMdk has been optionally devised in the 
particular recombinational cloning procedure. 

For example, an in vitro m^faod of selection can be devised for the Insert 
DonorandVectorDonorDNAmolecules. Such sdieme can involve engineering 
a rare restriction site in the starting circular vectors in such a way that after the 
recoinbination events tiie rare cutting sites end up in the Byproduct Hence, when 
tiie restriction enzyme which binds and cuts at the rare restriction site is added to 
tiie reaction mixture in vitro, all of the DNA molecules carrying the rare cutting 
site, /.e., the starting DNA molecules, the Comtegrate, and the Byproduct, will be 
cut and rendered nonreplicable in the intended host cell. For example, cutting 
sites in segments B and C (see Figure 1) can be used to select agamst ail 
molecules except the Product Alternatively, only a cutting site in C is needed if 
one is able to select for s^ment Z), e.g. , by a dmg resistance gene not found on B, 
Similarly, an in vitro selection metiuxi can be devised when dealing witii 
linear DNA molecules. DNA sequences complementaiy to a PGR primer 
sequence can be so engineered tiiat they are transferred, through the 
recombmational cloning metiiod, only to the Product molecule.-. After the 
reactions are completed, the ^inopriate primers are added to die reaction solution 
and the sample is subjected to PGR. Hence, all or part of die Product molecule 
is amplified. 

Odier in vivo selection schemes can be used with a variety of £. coU cell 
lines. One is to put a repressor gene on one segment of the subcloning plasmid, 
and a drug marker controlled by that repressor on the other segment of the same 
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plasmid Ano&er is to put a killer gene on segment C of the subcloning plasmid 
(Figure 1). Of course a way must exist for growing such a plasmid, /.e., there 
must exist circumstances under which die killer gene will not kill. Therearea 
number of these genes known whidi require particular strains of E. colL One 

S such sdieme is to use die restiicdon eozyme Dpnl, which will not cleave unless 

its recognition sequence GATC is mediylated. Many popular common K coli 
strains medi}4ate GATC sequences^ but diere are mutants in ^di cloned Dpnl 
cahbe esqnessed widiout hann. 

Of course analogous selection schmes can be devised fi>r btfa^ host 

10 organisms. For example, die terrqiressor/operator of 7>t70 has been adapted to 

control g^ expression in eukaryotes (Qossen, M., and Bujard, R, Proc. Natl. 
Acad ScL USA ^P:5547-5551 (1992)). Thus die same control of drug resistance 
by the tet repressor exemplified herdn can be implied to select for Product in 
eukaiyotic cells. 



IS RecombinaUon Proteins 



In the present invention, the exchange of DNA segments is achieved by 
the use of recombination proteins, including recombinases and associated 
co-fectors and proteins. Various recombination proteins are described in the art 
Examples of such recombinases include: 

20 Oe: A protdn from bacteriophage PI (Abrmiski and Hoess, J. Biol 

Chem 259(3):1509-1514 (1984)) catalyzes the exdiai^e G-e., causes 
reoombmadon) between 34 bp DNA sequences called loxP (locus of crossover) 
sites (See Hoess et aU Nvcl Acids Res, i^(S)'2287 (1986)). Cre is available 
commerdally (Novs^en, Catalog No. 69247-1). Recombinadon mediated by Cre 

25 is freely reverable. From diennodynandc considerations it is not surprising that 

Qe-mediated integration (recombination between two molecules to form one 
molecule) is much less efficient than Cre-mediated excision (recombination 
between two loxP sites in the same molecule to form two daughter molecules). 
Cre works in simple buffers with either magnesium or spermidine as a cofactor, 

30 as is well known in the art. The DNA substrates can be either linear or 
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supercoiied. A number of mutant loxP sites have been described (Hoess et aL, 
supra). One of these, laxP 511 ^ recombines with another loxP 511 site, but will 
uoX recombine with a loxP site. 

Integrase: A protein from bacteriophage lambda that mediates the 
integration of the lambda genome into flie .£ coli chromosome. The 
bacteriophage A, Int recomlnnatiQDal proteins promote hreverrible recomHnation 
between its substrate att sites as part of the formation or induction of a lysogenic 
state. Reversibility of tiie recombioation reactions results from two independent 
pathways for mtegrative and excisive recombination. Each pathway uses a 
liniqi^ but ovedapping, sxX of the 15 protein bmding sites that comprise att site 
DNAs. Cooperative and competitive interactions involving four proteins (Int, 
?Gs, IHF imd FIS) determine ti^ direction of recombination. 

Integrative reoombmation involves the Int and IHF proteins and sites attP 
(240 bp) and ottB (25 bp). Recombination results in the formation of two new 
sites: attL and attR, Excisive recombination reqmres Int, IHF, and Xis, and sites 
attL and attR to generate attP and eOtB, Under certain conditions, PIS stimulates 
excisive recombination. In addition to these normal reactions, it should be 
appreciated that attP and attB, when placed on the same molecule, can promote 
excisive recombination to generate two excision products, one with attL and one 
witii attR, Similarly, intermolecular recombination between molecules 
containing attL and attR, in the presence of Int, IHF and Xis, can result ui 
integrative recombmation and tiie generation attP and attB, Hence, by flanking 
DNA segments witii appropriate combinations of engineered att sites, in the 
presence of the appropriate recombination proteins, one can direct excisive or 
integrative recombination, as reverse reactions of each other. 

Each of the att sites contains a 15 bp core sequence; mdividual sequence 
dem^its of fiinctional significance lie within, out^de, and across the boundaries 
of this common core (Landy, A., Amu Rev. Biochem. 55:913 (1989)). Efficient 
recombination between the various att sites requires that the sequence of the 
central common region be idoitical between the recombining partners, however, 
tiie exact sequence is now found to be modifiable. Consequentiy, derivatives of 
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flie att site vwth changes within the core are now discovered to recombine as least 
as efficiently as the native core sequences. 

Integrase acts to recombine the attP site on bacteriophage lambda (about 
240 bp) with the attB site on the E, coli genome (about 25 bp) (W cisberg, RA- 
5 and Landy, A- in Lambda iJ, p. 21 1 (1983), Cold Spring Harbor Laboratory)), to 

produce the integrated lambda genome flanked by ottL (about 100 bp) and attR 
(about 160 bp) sites. In tibe absence ofSs (see below), Hus reaction is essentially 
irreversible. The integration reaction mediated by integrase and IHF woiks 
in vUrOy vnth sinq)le buffer containing spenni<yiie. Integrase can be obtained as 
10 desraibed by Nash, ILA., Methods of&izymohgy i0ft210-216 (1983). IHF can 

be obtained as described by Fihitowicz, M., etoL, Gene 7^7:149.150 (1994). 

In the presence of the X protdn Xis (exdse) integrase ciatalyzes ike 
reaction of attR aiid ottL to fimn attP and attB, i.e., it promotes the reverse of the 
reaction desoibed above. Ihis leai^on can also be applied in the present 
15 inventioa 

Other Recombination Systems. Numerous recombination systems fiom 
various organisnis can also be used, based on the teaching and guidance provided 
herein. See, e.g,, Hoess et al. Nucleic Acids Research 7<6):2287 (1986); 
Abremski et al, J. Biol. Chem.261il):39l (1986); Campbell, J, Bacterial 
20 /7-^(23):7495 (1992); Qian et aU Biol Chenu 2(^7(11):7794 (1992); Araki 

et al, J. Mol Biol 225(1):25 (1992)). Many of these belong to the integrase 
femily of recombmases (Aigos et a/. EMBO 1 5:433^440 (1986)). Perfiaps the 
best studied of these are the Integrase/ar/ system from bacteriophage X (Landy, 
A. (1993) Current Opimons in Genetics and Devel 3:699-707), the CxdloxP 
25 system from bacteriophage PI (Hoessand Abremski (1990jlnMicfe/c^ci<3frflffK7 

Molecular Biology^ voL 4. Eds,: Eckstein and Lilley, Berlin4Ieidelberg: 
Springer-Veriag; pp. 90-109) , and the FLP/FRT system from the Saccharomyces 
coevisiae 2 |t circle plasmid (Broach et al Cell 29:227-234 (1982)). 

Members of a second family of site-specific recombinases, the resolvase 
30 family (e,g.,y5,Tn3 resolvase, Hin, Gin, and Cin) are also known. Members of 

this highly related family of recombinases are typically constrained to 
intramolecular reactions (eg., inversions and excisions) and can require host- 
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encoded fectois. Mutants have been isolated that relieve some of die 
requixenients for host factors (Maeser and Kahnmann (1991) MoL Gen. Genet 
230:170-1 76), as well as some of &e constraints of intramolecular recombination. 

Oth^ ^te-specific recombinases similar to X Int and similar to PI Qe can 
be substituted for Int and Ore. Such recombinases are known. In many cases the 
purification of such other recombinases has been described in the ait. In cases 
when they are not known, cell extracts can be used or the ensues can be 
partially purified using procedures described for Cre and Int 

While Cre and Int are described in detail for reasons of example, many 
related recombinase systems exist and their application to the described invention 
is also {ffovided according to the present invention. Theintegrasefiuhily ofdte-t 
spedfic recombinases can be used to provide alternative recombination protdns 
and recombination sites for the present invention, as site-specific recombination 
protdnsencodedbybacteriophagelambda,phi80,P22,P2,186,P4andPl. This 
group of proteins exhibits an unejqiectedly large diversity of sequences. Despite 
tills diversity, all of the recombinases can be aligned m thdr C-terminal halves. 

A 40-Tesidue region near tiie C tenninus is particularly well conserved in 
all the proteins and is homologous to a region near the C terminus of the yeast 2 
mu plasmid Flp protem. Three positions are perfectly conserved within this 
family: histidine, arginine and tyrosine are found at respective alignment 
positions 396, 399 and 433 within the well-conserved C-terminal region. These 
residues contribute to the active site of this family of recombinases, and suggest 
that tyrosine-433 forms a transient covalent linkage to DNA during strand 
cleavage and rejoining. See, e.g., Argos, P. etal, EMBOJ. 5:433-40 (1986). 

Alternatively, IS231 and other Bacillus thuringiensis transposable 
elements could be used as recombination proteins and recombination sites. 
Bacillus thuringjen^ is an entomopathogenic bacterium whose toxicity is due 
to the presence in the sporangia of delta-endotoxin crystals active against 
agricultural pests and vectors of human and animal diseases. Most of the genes 
coding for tiiese toxin proteins are plasmid-bome and are generally structurally 
associated witii insertion sequ^ces (IS231, IS232, IS240, ISBTl and ISBT2) and 
transposons (Tn4430 and Tn5401), Several of these mobile elements have been 
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shown to be active and participate in the aystal gene mobility, thereby 
contribudng to tiie variation of bacterial toxicity. 

Structural analysis of the iso-IS23 1 elements indicates that they are related 
to IS1151 from Clostridium perfringens and distantly related to IS4 and IS186 
from Escherichia coli. Like the other IS4 femily members, they contain a 
conserved transposase-integrase motif found in other IS fruniiies and retro^aruses. 

Moreover, frinctional data gathered from IS231A m Escherichia coii 
indicate a non-replicative mode of transposition, with a preference for specific 
targets. Similar results were also obtained in Bacillus subtilis and B, 
thuringiensis. See, e.g,, Mahillon, letaL, Genetka P5:13-26 (1994); Campbell, 
J. BacterioL 7495-7499 (1992), 

The amount of lecombinase which is added to drive the cecombmation 
reaction can be determined by u^g known assays. Specifically, titration assay 
is used to determine the appropriate amount of a purified recombmase en2yme, 
or the aj^xropriate amount of an extract 

Engineered Recombination Sites. The above recombinases and 
corresponding recombinase sites are suitable for use in recombination cloning 
according to the present inventioiL However, wild-type recombination sites 
contain sequences that reduce the efficiency or specificity of recombination 
reactions as applied in methods of the present invention. For example, multiple 
stop codons in attB, attR, attP, attL and loxP recombination sites occur in 
multiple reading fiMies on both strands, so recombination efficiencies arc 
reducted, e.g, where the coding sequence must cross the recombination sites, 
(only one reading frame is available on each strand of loxP and attB sites) or 
impossible (in attP, attR or attL); 

Accordingly, the present invention also proiddes ' engineered 
recombination ^tes that overcome these problems. For example, att sites can be 
engineered to have one or multiple mutations to enhance spedficity or efGciency 
of the recombination reaction and the properties of Produa DNAs {e.g.^ attl, att2, 
and atG sites); to decrease reverse reaction (e.g., removing PI and HI from attB). 
The testmg of these mutants deterrrunes which mutants yield sufficient 
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lecombinadonal activity to be scdtable for recombination subcloning according 
to the piesent invendon. 

Mutations can therefore be introduced into recombination sites for 
enhancing site specific recombination. Such mutations include, but are not 
limited to: recombination sites without translation stop codons that allow fusion 
proteins to be aicoded; recombination sites recognized by the same proteins but 
differing in base sequence such that they react largely or exclusively with their 
homologous partners allow multiple reactions to be conten^lated. Which 
particular reactions take place can be specified by which particular partners are 
present in tiie reaction mixture. For example, a tripartite protem fiision could be 
accomplished with parental plasmids containmg recombination rites attRl and 
atai2; attLl and atlL3; and/or attR3 and attiL2. 

There are well known i»ocedures for introducu^ ^)ecific mutations into 
nudeic add sequences. A inm&beroftiiese are described ixLAusubel,F.M.er a/.. 
Current Protocols in Molecular Biology, Wiley Intersdence^New York (1989- 
1996). Mutations can be derigned into oligonucleotides, which can be used to 
modify existing cloned sequences, or in ampUfication reactions. Random 
mutagenesis can also be en[q)loyed if q)propriate selection methods are available 
to isolate fiie desired mutant DNA or RNA. The presence of the desired 
mutations can be confirmed by sequencing tiie nucleic acid by well-known 
methods. 

The following non-limiting methods can be used to engineer a core region 
of a given recombination site to provide mutated sites suitable for use in the 
present invention: 

1. By iwjmbination of two parental DNA sequences by site-specific (e.g. 

attL and attR to give attB) ot other (e.g. homologous) 
recombination medianisms. The DNA parental DNA segments 
containing one or more base alterations resulting in the final core 
sequence; 

2. By mutation or mutagenesis (site-specific, PGR, random, spontaneous, 

etc) (firectiy of the desired core sequence; 
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3. By mutagenesis (site-specific, PGR. random, spontanteous, etc) of 

parental DNA sequaices, v*ich arc recomb'med to generate a 
desired core sequence; and 

4. By reverse transcription of an RNA encoding the desired core 

sequence. 

The functionality of the mutant recombination sites can be demonstrated 
in ways that dq)end on the particular characteristic fliat is desired, Forexanqile, 
the lack of translation stop codons in a recombination ate can be demonstrated 
by e3q)ressir^ the appropriate fusion proteins. Spedficity of recombmation 
between homologous partners can be demonstrated by introdudng the appropriate 
molecules into in vitro reactions, and assaying for recombmation products as 
described ham or known m the art Odier desired mutations in recombmation 
sites mi^t include the presence or absence of restriction sites, translation or 
transcription start ^gnals, protein binding sites, and other known functionalities 
of nucleic acid base sequences. Genetic selection schemes for particular 
fimctional attributes in the recombination sites can be used according to known 
method steps. For example, the modification of sites to provide (from a pair of 
sites that do not interact) partners that do interact could be achieved by requiring 
deletion, via recombination between the sites, of a DNA sequence encoding a 
toxic substance. Similarly, selection for sites that remove translation stop 
sequences, the presence or absence of protein bindmg sites, etc., can be easily 
devised by those skilled in die art 

Accordingly, the present invention provides a nucleic add molecule, 
comprismg at least one DNA segment havmg at least two engineered 
recombination sites flanking a Selectable marker and/or a dedred DNA segment, 
v^^erdn at least one of said recombination sites comprises a core region havmg 
at least one ec^eered mutation that enhances recombination in vitro in the 
formation of a Cointegrate DNA or a Product DNA. 

The nucleic arid molecule can have at least one mutation that confers at 
least one enhancement of said recombination, said enhancement selected from the 
group consistmg of substantially (i) favoring excisive integration; (ii) favoring 
excisive recombmation; (ii) relie>dng the requirement for host factors; (iii) 
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increasing the efficiency of said Cointegrate DNA or Product DNA formation; 
and (rv) inoeasing the specificity of said Cointegrate DNA or Product DNA 
formation. 

Hie nucleic acid molecule preferably comprises at least one 
recombination site derived fiom attB, attP, attL or attR. More preferably the att 
site is selected from attl, att2, or att3, as described herein. 

In a preferred embodiment, the core region comprises a DNA sequence 
selected fiom the group consisting of: 

(a) RKyCWGCITIWmTACNAASTSGB(m-a^^ 

(b) AGCXJWGCTTTYKTRTACNAACTSGB (m-attB) (SEQ ID 

N0:2); 

(c) GTTCAGClTranKTAa^AACTSGB (m-attR) (SEQ ID 

N0:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID 

N0:4); 

(e) GTTCAGCTTlTKTRTACNAAGTSGB(m^ttPl) (SEQ ID 

N0:5); 

or a corresponding or complementary DNA or RNA sequence, wherein R=A or 
G; K=G or T/U; Y=C or T/U; W=A or T/U; N-A or C or G or T/U; S=€or G; 
and B=C or G or TAJ, as presented in 37 C.F,IL §1.822, which is entirely 
incotporated herein by reference, wherein tiie core region does not contain a stop 
codon in one or more reading frames. 

The core region also preferably comprises a DNA sequence selected from 
the group consisting of: 

(a) AGa:TGCnTITITGTACAAAOTGT(attBl)(SEQIDN0:6); 

(b) AGCCTGCITTCTTGTACAAACTTGT(atlB2)(SEQIDNO:7); 

(c) A(XCAGCTTIUrTGTACAAAeTTGT(attB3) (SEQ IDN0:8); 

(d) GTTCAGCrrTTrrGTACAAACrrGT(attf l)(SEQIDNO:9): 

(e) GrrcAGCTrTCTrGTACAAACTrGT(at(Rj2)(SEQroN0 

(f) GTTCAGCnTCTtGTACAAAGTTGG (attR3) (SEQ ID 
N0:11); 
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(g) AGCCrGCrmTTGTACAAAGTTGG (attLl) (SEQ ID 
N0:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGG (attL2) (SEQ ID 
N0:13); 

5 0) ACCCAGCTTTCTTGTACAAAGTTGG (attL3) (SEQ ID 

N0:14); 

Q) GTTCAGCriTTTIGTACAA^ 

(k) GTTCAGCrrTCrrGTACAAAGTTGG (attP2,P3) (SEQ ID 
N0:16); or a corresponding or coniplemoitazy DNA or RNA sequence. 
IQ The present iilvendon thus also provides a mediod for maldng a nucleic 

acid molecule, con^nising prodding a nucldc add molecule having at least one 
engineeted recombination site compridng at least one DNA sequence havir^ at 
least 80-99% homology (or any range or vahie therein) to at least one of SEQ ID 
N0S:1-16, or any suitable recombination site, or which hybridizes under 
15 stringent conditions thereto, as known in the art 

Clearly, there aie various types and permutations of such well-known 
in vitro and in vivo selection methods, each of which are not described herein for 
the sake of bre\Tity. However, such variations and permutations are contemplated 
and considered to be the different embodiments of the present invention. 
20 It is important to note that as a result of the preferred embodiment bdi^ 

in vitro recombination reactions, non-biological molecules such as PGR products 
can be manipulated ^ tiie present recombinational cloning method. In one 
example, it is possible to clone linear molecules mto drcular vectors. 

Th^ are a number of applications fertile present inventiorL These uses 
25 mclude, but are not limited to, dian^ng vectors, a?}poshig promoters wth genes, 

constructmg genes for fusion proteins, chan^ng copy number, chang^ 
replicons, clonmg into phages, and cloning, eg., PGR products (witii an attB site 
at one end and a loxP site at the other end), goiomic DNAs, and cDNAs. 
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Tbe following examples arc intended to fiirflier illustrate certain preferred 
embodiments of the invention and are not intended to be Umitii^ in nature. 



Examples 

The present recombinadonal cloning metiiod accomplishes the exchange 
of nucleic acid segments to render somethmg useful to the user, such as a change 
of cloning vectors. These segments must be flanked on both sides by 
recombination signals that are in the proper orientation with respect to one 
another. In the examples below Ae two parental nucleic acid molecules (eg. , 
plasmids) are called the Insert Donor and the Vector Donor. The Insert Donor 
contains a segment that will become j oined to a new vector contributed by the 
Vector Donor. The recombmation intKmediate(s) that oontain(s) both starting 
molecules is called tiie Couxtegrate(s). The second recombination event produces 
two daughter molecules, called the Product (the desired new clone) and tiie 
Byproduct 

Buffers 



Various known buffers can be used in the reactions of the present 
invention. For restriction enzymes, it is advisable to use tiie buffers 
recommended by the manufacturer. Alternative buffers can be readily found in 
the literature or can be devised by those of ordinary skill in the art 

Examples 1-3. One exemplary buffer for lambda integrase is comprised 
of 50 mM Tris-HCl, at pH 7.5-7.8, 70 mM KCl, 5 mM spermidine, 0.5 mM 
EDTA, and 0,25 mg/ml bovine saum albumin, and optionally, 10% gJyceroL 

One preferred buffer for PI Cre recombinase is con^irised of 50 mM 
Tris-HCl at pH 7.5, 33 mM NaCi, 5 mM spemudine, and 0,5 mg/ml bovine 
serum albumin. 

The buffer for other site-specific recombmases wMch arc sknilar to 
lambda Int and PI Cre arc eitiier known in the art or can be determmed 
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empirically by the skilled artisans, particularly in light of tiie above-described 

Example 1: Recombinational Cloning Using Cre and Cre & Int 

Two pairs of plasmids were constructed to do the in vitro recombinational 
cloning mediod in two dififerent "ways. One pair, pEZC705 and pEZC726 
(Figine 2A), was constructed with loxP and att ^tes, io be used with Qe and 
Aintegiase. The o&a pair, p£ZC602 and pEZC6290E^igure3A), contained the 
laxP (wild type) site for Cre, and a second mutant lax site, loxP 511, which 
differs from /oxP in one base (out of 34 total). The minimum requirement for 
recomlnnational cloning of &e preset inv^on is two recombmation sites in 
eachplasmid,ingen^XandF,andA7andF. Recombinadonaldomng takes 
place if either or bodi types of site can recombme to form a Cointegrate {e.g, X 
and X'), and if either or both (but necessarily a site different from the type 
forming the Cointegrate) can recombine to excise the Product and Byproduct 
plasmids from the Comtegrate (e.g. Y and F), It is hnportant to the 
recombination sites on the same plasmid do not recombine. It was found that the 
present recombinational cloning could be done with Cre alone. 

Cre-Only 

Two plasmids were constructed to demonstrate this conception {see 
Figure 3 A). pEZC629 was the Vector Donor plasmid. It contained a constitutive 
drug marker (chloramphenicol resistance), an origin of feplicadon, laxP and 
loxP 511 sites, a conditional drug maiker (kanamycin redstance whose 
expression is controlled by the operator/promoter of the tetracycline resistance 
operon of transposon TiilO), and a constitutively expressed gene for the tet 
repressor protein, tetR. E. coll cells containing pEZC629 were resistant to 
chloramphenicol at 30 ^g^ml, but sensitive to kanamycin at 100 ^g/ml. pEZC602 
was the Insert Donor plasmid, which contained a different drug marker 



WO9e/40724 PCr/«S9(J/10082 

-33- 

(an^icmin icsistanoe), an ori^ and teP and JoxP 51 J sites flanking a multiple 
cloning dte: 

This experiment was comprised of two parts as follows: 
Parti: About 75 ng each of pEZC602 and pEZC629 were mixed in a 
total volume of 30 |il of Cre buffer (50 mM Tris-HCl pH 7.5, 33 mM NaCl, 
5 mM spennidine-HCl, 500 |ig/ml bovine serum albumin). Two 10 jil aliquots 
were transferred to new tubes. One tube iccdved 0,5 id of Qe protein (approx. 
4 units per partially purified according to Abrwnsld and Hoess, J. Biol Cheat 
259:1509 (1984)). Both tubes were incubated at 37*C for 30 minutes, then 70*0 
for 10 mnutes. Aliquots of each reaction were diluted and transformed into 
DH5a. Followi^ eaq^reasion, aliquots were plated on 30 |ig/ml 
cUoranoAemcoU 100 [xg/ni anqridlBn plus 200 iig/ml meflridllin; or 100 figtol 
kanamycin. ResuUs: 5^ Table 1. The reaction without Cre gave 1.11x10* 
an^dUin reastant colonies (fiom the Inscat Donor plasmid pEZC602); 7.8x10^ 
diloramphenicol resistant colonies (ftom the Vector Donor plasmid pEZC629); 
and 140 kanamycin resistant colonies (background). The reaction with added Cre 
gave 7.5x10^ ampicillin resistant colonies (from the Insert Donor plasmid 
pEZC602); 6. 1x1 0^ chloramphenicol resistant colonies (from the Vector Donor 
plasmid pEZC629); and 760 kanamydn resistant colonies (mixture of 
background colonies and colonies from the recombinational cloning Product 
plasmid). Analysis: Because the number of colonies on the kanamycin plates 
was much higher in the presence of Cre, many or most of them were predicted to 
contain the desired Product plasnud. 



Table 1 


Enzyme 


Ampidllln 


Chloramphenicol 


Kanamycin 


Efficiency 


None 


l.lxlO* 


7,8x10* 


140 


140/7.8x10^=0.02% 


Cre 


7.5x10' 


6.1x10* 


760 


760/6.1x10^=0.12% 



Part 11: Twenty four colonies from the **+ Cre" kanamycin plates were 
picked and inoculated into medium containing 100 ng/ml kanamycin. Minipreps 
were done, and the muiiprep DNAs, uncut or cut with Smal or Hindlll, were 
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clortrophoiesed. Results: 19 of tiie 24 minipreps showed supercoiled plasmid 
of the size predicted for die Product plasmid Ail 19 showed the predicted Smal 
and Hitidm restriction fiagments. Analysis: The Cre only scheme was 
demonstrated SpecificaDy, it was detamined to have yielded about 70% (19 of 
24) Product clones. The cfiBciency was about 0.1% (760 kanamycin resistant 
clones resulted from 6.1x10^ d)ioranq)henicol redstant colonies). 

Cre Pius Iniegrase 



The plasmids used to demonstrate tius method are exacdy analogous to 
those used above, except that pEZC726, the Vector Donor pla^d, contained an 
attP site in place of ibxP 511, and pEZC70S, the Insert Donor plasmid, contained 
an ottB site in place of loxP 511 (Figure 2A). 

This e:q)eriment was comprised of three parts as follows: 
Fart I: About 500 ng of pEZC705 (the Insert Donor plasmid) was cut 
with SccH^ which linearized the plasmid within the ampicillin resistance gene. 
(This was done because the A. integrase reaction has been historically done with 
the OttB plasmid in a linear state (H. Nash, personal communication). However, 
it was found later that the integrase reaction proceeds well with both plasmids 
siqjercoiled) Then, the linear plasmid was ethanol precipitated and dissolved in 
20 nl of X mtegrase buffer (50 mM Tris-HCI, about pH 7.8, 70 mM KCl, 5 mM 
sp^xni(fine-HQ, 0.5 mM EDTA, 250 ^gAnl bovine serum albumm). Also, about 
500 ng of the Vector Donor plasmid pEZC726 was e&anol predpitated and 
dissolved in 20 (il 1 integrase buffer. Just before use, X mtegrase (2 (il, 
393 )ig/ml) was thawed and diluted by adding 18 |il cold k integrase buffer. 
One \il IHF (integration host &ctor, 2.4 mg/ml, an accessory protem) was diluted 
into 150 111 cold X integrase buffer. Aliquots (2 fil) of each DNA were mixed 
with A, integrase buffer, with or without I \i\ each X integrase and IHF, in a total 
of 10 The mixture was mcubated at 25*C for 45 minutes, then at 70"C for 
10 minutes. Half of each reaction was applied to an agarose gel. Results: In the 
presence of integrase and IHF, about 5% of the total DNA was converted to a 
linear Cointegrate form. Analysis: Activity of integrase and IHF was confumed. 
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Partll: ThiwinicroUtei^ of each reaction (tc.,\wtii or wtho 
and IHF) were diluted into 27 ^il of Cre bufifer (above), then each reaction was 
split into two 10 \il aliquots (four altogether). To two of these reactions, 0.5 \il 
of Cre piotein (above) were added, and all reactions were incubated at 3TC for 
30 minutes, thai at 70'C for 10 minutes. TE bufifer (90 TE: 10 mM Tris-HCl, 
pH 7.5, 1 mM EDTA) was added to each reaction, and 1 eadi was transfonned 
into K coli DH5a. The transformation mixtures were plated on 100 |ig/nd 
lonpicillin plus 200 ^g/ml meducilUn; 30 ^g/ml chloramphenicol; or 100 ^g/ml 
kanamycin. ResuUs: See Table 2. 



Table 2 


Enzyme 


AmpiciUin 


Chloramphenicol 


Kanamycin 


EfBciency 


None 


990 


20000 


4 


4/ 2x10* =0.02% 


Cre only 


280 


3640 


0 


0 


hitegrase* 
only 


1040 


27000 


9 


9/ 2,7x10* = 0.03% 


Integrasc* 
1 +Cre 


110 


1110 


76 


76/l.lxl0'«6.9% 



* Integtase reactions also contained IHF. 



Analysis: The Cre piotein inqi^red transfonnation. When adjusted for 
tins efifect, die number of kanamycin resistant colomes, compared to the control 
reactions, inoeased more lSam 1 00 fold when botii Cre and Integrase were used. 
This suggests a specificity of greater than 99%. 

Part JH: 38 colonies were picked from the Integrase plus Cie plates, 
miniprq) DNAs were made and cut with f/wdlll to give diagnostic mapping 
information. Result: All 38 had precisely the ejqjected fragment sizes. 
Analysis: The Cre plus A. integrase method was observed to have much higher 
specificity than Cre-alone. Conclusion: The Cre plus X integrase method was 
demonstrated. Efficiency and specificity were much higher than for Cre only. 
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Exan^U 2: Using in vitro Recombinational Cloning to Subclone the 
Chloranqfhenicol Acetyl Transferase Gene into a Vector for Expression in 
Eukttryotic Cells (Figure 4A) 

An Insert Donor plasmid, pEZC843, was constructed, comprising the 
ddoramphenicol acetyl transferase gene of £. co//, cloned between loxP and attB 
sites such that the loxP site was positioned at 4ie 5'-end of the gene (Figure 4B). 
A Vector Donor plasmid, pEZC1003, was constructed, which contained the 
cytomegalovirus eukaryotic promoter apposed to a loxP site (Figure 4C). One 
microliter aliquots of each supercoiled plasmid (about 50 ng crude minipiep 
DNA) were combined in a ten microliter reaction containing equal parts of 
lambda integrase buffer (50 mM Tris-HCI, pH 7.8, 70 mM KCl, 5 mM 
spennidinc, 0.5 mM EDTA, 0.25 mg/ml bovine serum albumin) and Cre 
recombinasebuff«-(50mMTris-HCl,pH7.5,33niMNaCl, 5 mM spcnmdine, 
0.5 mgM bowie serum albumin), two umts of Cre lecombinase, 16 ng 
int^tion host fector, and 32 ng lambda integrase. After incubadon at 30''C for 
30 minutes and 75*C for 10 minutes, one microliter was transfonncd into 
competent £. coli strain DH5a (Life Technologies, Inc.). Aliquots of 
transformations were spread on agar plates containing 200 ng/ml kanamycin and 
incubated at 37'C overnight. An otherwise identical control reaction contained 
the Vector Donor plasmid only. The plate receiving 10% of the control reaction 
transformation gave one colony; the plate receiving 10% of the recombinational 
clomng reaction gave 144 colonies. These numbers suggested that greater than 
99% of the recombinational cloning colonies contained the desired product 
plasmid. Miniprep DNA made from six recombinational cloning colonies gave 
the predicted size plasmid (5026 base pmrs), CMVProd. Restriction digestion 
witiiifcol g3cve Ae fragments predicted for the diloranq)hOTicol ace^l transferase 
cloned downstream of the CMV promoter for all six plasmids. 
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Subdoned DNA Segments Flanked by attB Sites WUhout Stop 

Parti: Background 

The above examples aie suitable for transcriptional fiisions, in which 
transcripdon crosses recombination sites. However, both attR and loxP sites 
contain multiple stop codons on both strands, so tianslational fusions can be 
difiGcult, vAascQ tiie coding sequence must cross the recombination sites, (only one 
reading £tame is available on eadi strand of loxP sites) or in^)ossible (in attR or 
attL). 

A prindpal xeason for subclomng is to fiise protein domains. For 
example, fusion of the glutathione S-transferase (GST) domam to a protein of 
interest allows the fusion protein to be purified by affinity chromatography on 
glutathione agarose (Pharmacia, Inc., 1995 catalog). If the protem of interest is 
fused to runs of consecutive histidines (for example His6), the fusion protein can 
be purified by affini^ diromatography on chelating resins containing metal ions 
(Qiagen, Inc.). It is often desuable to compare amino terminal and carboxy 
terminal fusions for activity, solubility, stability, and the like. 

The attB sites of the bacteriophage X integration system were examined 
as an alternative to loxP sites, because they are small (25 bp) and have some 
sequence flexibility (Nash, HA. et aL, Proc Natl Acad Set USA 5-/:4049-4053 
(1987). It was not previously suggested that multiple mutations to remove all 
stop codes would result in useful recombination sites for recombinational 
subclonmg. 

Using standard nomoiclature for site specific recombination in lambda 
bactmophage (Weisber, in Lambda lU, Hendrix, et al , cds.. Cold Spring Harbor 
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Laboialoiy, Cold Spring Haibor, NY (1989)), the nucleotide R^ons that 
participate in fte recombination reaction in an £ co// host cell are represented 
as follows: 

attP —PI- .HI- -P2— X- -H2 - -C-O-C- -H'— P'l— P'2— P*3 — 

+ 

attB --B-O-B* 

lat, IHP It Xis, mt, IHF 

attR — P1--H1— P2— X— H2— C-O-B* — 

+ 

attL — B-O-C— H'— P«l— P"2— P'3— , 



where: O represents the 15 bp core DNA sequence found in both the phage and 
E. coli genomes; B and B' represent approximately 5 bases adjacent to the core 
in the E coli genome; and PI, HI, P2, X, H2, C, C, IT, Fl, P'2, and P3 represent 
known DNA sequences encoding protein binding domains in the bacteriophage 
X genome. 

The reaction is reversible in the presence of the protein Xis (excisionase); 
recombination between attL and attR precisely excise the A, genome from its 
integrated slate, regenerating tiie drcular \ genome containing attP and the linear 
K coli genome containing attB. 

Pdrtll: Cottstructwn and Testing ofPlasmds Containing Mutants 

Mutant attL and attR sites were constructed. Importantly, Landy et al, 
{Ann. Rev, Biochem. 5^:913 (1989)) observed that deletion of the PI and HI 
domains of attP faciUtated the excision reaction and eliminated the integration 
reaction, tiiereby makmg tiie excision reaction irreversible. Therefore, as 
mutations were introduced in attR, tiie PI and HI domains were also deleted. 
attR sites in tiie present example lack flie PI and HI regions and have tiie Ndel 
site removed (base 27630 changed from C to G), and contain sequences 
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corresponding to bacteriophage X coordinates 27619-27738 (GenBank leiease 
92.0, bg:LAMCG, '^Complete Sequence of Bacteriophage Lambda"). 

Hie sequence of attB produced by recombination of wild type attL and 
attRsitesis: 



attBwt: 5« AGCXTT GCTTTTTTATACrAA CTTGA 3* (SEQ. ID 

£10:31) 

3* TCX3GA CX3RAA AAAm TCATT GAACT 5* • 



The stop codons are italicized and undCTlined Note that sequences of attL, attR, 
and attP can be derived from the attB sequence and the boimdaries of 
bacteriophage X contained within attL and attR (coordinates 27619 to 27818). 

When mutant attRl and attLl sites were recombined the sequence attBl 
was produced (mutations in bold, large font): 

B o B« 

attBl: 5' MCCT <3ctTTTTtGtACAaA CTTgT 3 ^^EQ. ID NO:6) 

3' TCGGA cGAAAAAaCaTgTtT GAACA 5' 

Note that the four stop codons are gone. 

When an additional mutation was introduced in the attRl and attLl 
sequences (bold), attR2 and attL2 sites resulted Recombination of attR2 and 
attL2 produced the attB2 site: 

B o B' 

attB2: 5' AGCCT gcTTtCtTGTACAAA ^""^"^ (SEQ. ID N0:7) 

3' TCGGA cGRAAGaACATGTTT 5' 



The recombination activities of the above attL and attR sites were assayed 
as follows. The attB site of plasmid pEZC705 (Figure 2B) was replaced with 
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attLwt,attLl, orattL2- The attP site of plasmid pEZC726 (Figure 2C) was 
replaced with attRwt (lacldiig regions PI and HI), attRl, or attR2. Thus, the 
resulting plasmids could recpmbine via their loxP ates, mediated by Qe, and via 
their attR and attL sites, mediated by Int,Xis, and IHF, Pans of plasmids were ■ 
5 mixed and reacted with Cre, Int, Xis, and IHF, transformed into K coli competent 

cells, and plated on agar containing kanamycin. The results arc presented in 
Table 3: 



Tables 





Vector donor att site 


Gene donor att site 


# of kanamycin resistant colonies* 


10 


attRwt (pEZC1301) 


None 


1 (background) 






attLwt(pEZC1313) 


147 






attLl (pEZC1317) 


47 




ti 


attL2(pEZCl321) 


0 




attRl (pEZC1305) 


None 


1 (badcgiound) 


15 


n 


atflL.wt(pEZCl313) 


4 




« 


attLl (pEZC13l7) 


128 






attL2(pEZC1321) 


0 




attRl (pE2X:i309) 


None 


9 (background) 




M 


attLwt(pEZC13l3) 


0 


20 


ft 


attL2(pEZC1317) 


0 




II 


attL2(pEZC1321) 


209 



(* 1% of each transformation was spread on a kanamycin plate.) 



The above data show that whereas the wild type att and attl sites 
recombine to a small extent, the attl and att2 sites do not recombine detectably 

25 with each other. 

PartlU, Recombination was demonstrated \rfientiiie core region of both 
attBdtes flanking the DNA segment of intoest did not contain stop dodons. The 
physical state of the participadng plasmids was discovered to influence 
recombination efficiency. 

30 The appropriate att sites were moved into pEZC705 and pEZC726 to 

make the plasmids pEZC1405 (Figure 5G) (attRl and attR2) and pEZC1502 
(Figure 5H) (atfl^l and attL2). Hie desired DNA segment in this experiment was 
a copy of the chloramphenicol resistance gene cloned between the two attL sites 
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of pEZC1502. Paiis of plasmids were lecombincd in vitro using Int, Xis, and 
IHF (no Qe because no loxP sites were present). The yield of desired kanamycin 
lesistant colonies was d^nnined v^en both parental plasmids were circular, or 
when one plasmid was circular and the other linear as presented in Table 4: 



Table 4 



Vector donor* 


Gene donor' 


Kanamycin resistant colonies^ 


Circular pEZCHOS 
Circular pEZC1405 
Linear pEZCMOS 
Linear pEZC1405 
Circular pEZC1405 


None 

Circular p£ZClS02 
None 

Circular pEZClS02 
Linear pEZClS02 


30 
2680 
. 90 
172000 
73000 



' DNAs were purified wfli QlagexL cohmms, concentrations determined by 
A260, and linearized with Xba 1 0>EZC1405) or AlwN I (pEZC1502). Each 
reaction contained 100 ng of the indicated DNA. All reactions (10 |il total) 
contained 3 nl of enzyme mix pCis, Int, and IHF). After incubation (45 minutes 
at 25 °, 10 minutes at 65 *), one fil was used to transfonn R coli DH5a cells. 
^Number of colonies expected if the entire transforation reaction (1 ml) had been 
plated. Either 100 |il or 1 |il of the transformations were actually plated. 

Analysis: Recombinational cloning u^ng mutant attR and attL sites was 
confirmed. The deared DNA segment is subcloned between attB sites that do not 
contain any stop codons in either strand. The enhanced yield of Product DNA 
(when one parent vm Imear) was unexpected because of earlier observations that 
file excision reaction was more efficient when both participatir^ molecules were 
si^)ercoiled and proteins were limiting (Nunes-Ehiby et al,. Cell 50:779-788 
(1987), 



Example 4: Demonstration of Recombinational Cloning Without Inverted 
R^eats 



Parti: Rationale 



The above Example 3 showed that plasmids contdning inverted repeats 
of the appropriate recombination sites (for example, attLl and attL2 in plasmid 
pEZC1502) (Figure 5H) could recombine to give the desired DNA segment 
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flanked by attB sites without stop codons, also in inverted orientation. A concern 
was the in vivo and in vitro influence of the inverted repeats. For example, 
transaiption of a desired DNA segment flanked by attB sites in inverted 
orientation could yield a single stranded RNA molecule that might form a haiq}in 
structure, thereby inhibiting translation. 

Inverted orientation of similar recombination sites can be avoided by 
pladng the sites in direct repeat aiiangementatt sites. If parental plasmids each 
have a wild type atO, and wild ^ atffl site, in du«ct rq)eat tiie In^ 
prdtdns will simply ronove the DNA segm^t flanked by fliose sites in an 
intramolecular reaction. However, the mutant sites described in the above 
Example 3 suggested fliat it ought be possible to inhibit flie intramolecular 
reaction vMlc allowing the intmnolecular recombination to proceed as desired. 

Partll: Structure of PlasnddsWUhout Inverted Rf^eats for Recombin^ 
Cloning 

Hie attR2 sequence in plasmid pEZC1405 (Figure 5G) was replaced with 
attL2, m the opposite orientation, to make pEZC1603 (Figure 6A), The attL2 
sequence of pEZC1502 figure 5H) was replaced with atffi2, in flie opposite 
orientation, to make pEZC1706 (Figure 6B). Each of these plasmids contained 
mutations in the core region that make intramolecular reactions between attl and 
atf2 cores v^ inefficient (see Example 3, above). 

Plasmids pEZC1405, pEZC1502, pEZC1603 and pEZC1706 woe 
purified on Qiagen columns ((^en. Inc.). Aliquots of plasmids pEZC1405 and 
pEZC1603 were linearized with Xba 1. Aliquots of plasmids pEZC 1502 and 
pEZC1706 were linearized with AlwN L One hundred ng of plasmids w^e 
mixed in buffer (equal volumes of 50 mM Tris HCl pH 7.5, 25 mM Tris HCI pH 
8.0, 70 mM KCl, 5 mM spermidine, 0.5 mM EDTA, 250fig/ml BSA, 10% 
glycerol) containing Int (43.5 ng), Xis (4.3 ng) and IHF (8.1 ng) in a final volume 
of 10 nl. Reactions were incubated for 45 minutes at 25 "C, 10 minutes at 65*'C, 
and 1 nl was transformed into E. coli DH5a. After expression, aliquots were 
spread on agar plates containing 200 pig/ml kanamycin and incubated at 37*'C. 
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Results, expressed as the number of colonies per 1 \i\ of xecombination 
xeaction are presented in Table 5 : 



Tables 



5 



10 



Vector Donor 


Gene Donor 


Colonies 


Predicted % product 


Circular 1405 




100 




Circular 1405 


Circular 1502 


3740 


3640/3740 = 97% 


Linear 1405 




90 




Linear 1405 


Circular 1502 


172,000 


171,910/172,000 = 99,9% 


Circular 1405 


Linear 1502 


73,000 


72,900/73,000 = 99.9% 










Circular 1603 




80 




Circular 1603 


Circular 1706 


410 


330/410-80% 


Linear 1603 




270 




Linear 1603 


Circular 1706 


7000 


6730/7000 = 96% 


Circular 1603 


Unear 1706 


10,800 


10,530/10,800=97% 



15 Analysis. In ail configurations, i.e., circular or linear, the pEZCl 405 x 

p£ZCl502 pair (vdlh att sites in inverted r^eat configuration) was more efficient 
than p£ZC1603 x pEZC1706 pair (with att sites mutated to avoid luuipin 
formation). The pEZC1603 x pEZC1706 pair gave higher backgrounds and 
lower efficiencies than the pEZC1405 x pEZC1502 pair. While less efficient, 

20 80% or more of the colonies from the pEZC1603 x pEZC1706 reactions were 

expected to contain the desired plasmid product Making one partner linear 
stimulated the reactions in all cases. 

Part III: Confirmation of Product Plasndds ' Structure 

Six colonies each from the linear pEZC1405 (Figure 5G) x circular 
25 pEZCl502 (Figure 5H), circdar p£ZC1405 x linear pEZC1502, linear 

pEZC1603 (Figure 6A) x circular pEZCl706 (Figure 6B), and circular 
pEZC1603 X linear pEZC1706 reactions were picked into rich medium and 
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mmqjrepDNAs were prepared. Diagnostic cuts with Ssp I gave the predicted 
lestriction fragments for all 24 colonies. 

Analysis. Recombination reactions between plasmids with mutant attL 
and attR sites on the same molecules gave the desired plasmid products with a 
5 high degree of specificity. 

Example 5: Recontbinaiional Cloning with a T&xic Gene 
Pari I: Background 

Restriction enzyme Dpn I recognizes the sequence GATC and cuts tiiat 
sequoice only if die A is methylated by die dam metfaylase. Most commonly 
10 used E. coll strains are dam^ E}^]:ession of Dpn I in dam* strains of £. coli is 

lethal because the diromosome of the cell is chopped into many pieces. 
However, in dam" cells expression of Dpn 1 is innocuous because the 
chromosome is immune to Dpn I cutting. 

In the general recombinational cloning scheme, in which the vector donor 
15 contains two segments C and D separated by recombination sites, selection for 

the desired product depends upon selection for the presence of segment D, and 
the absence of segment C. In the original Example segment D contsuned a drug 
resistance gene (Km) that was negatively controlled by a rq)ressor gene found on 
segment C. When C was present, cells containing D were not resistant to 
20 kanamycm because the resistance gene was turned off. 

The Dpn I gene is an example of a tosdc gene that can replace the 
repressor gene of die .above anbodiment If segment C expresses the Dpn I gene 
product, transforming plasmid CD mto a dam* host kills die cell. If segm^t D 
is transferred to a new piasnud, for example by recombinational cloiung, then 
25 selecting for the drug marker will be successfid because the toxic g^e is no 

longer present 
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Fartn: Constructidtt df a Vector nonor Using Dpn I as a Tasdc Gene 

The gene encoding Dpn I endonuclease was amplified by PGR using 
primers 5'CCA CCA CAA ACG CGT CCA TGG AAT TAC ACT TTA ATT 
TAGS' (SEQ. ID NO: 17) and 5*CCA CCA CAA GTC GAC GCA TGC CGA 
CAG CCT TCC AAA TGT3' (SEQ. ID N0:18) and a plasnud containing the Dpn 
I gene (derived ftom plasmids obtamed ftom Sanfoid A. Lades, BtooldiaYen 
National Laboratory, Upton, New York; also avwlable fiom American Type 
Culture Collection as ATCC 67494) as the template. 

Additional mutations were introduced into the B and B' re^ons of attL 
and atffi, respectively, by amplifsfing ejdsting attL and atlR domains witii primers 
contaiiung tiie deaied base changes. Recombmation of tiie mutant atfl.3 (made 
wifli oligo XisllS) and attR3 (made with oligo Xisll2) yielded attB3 with the 
following sequence (differences firom attBl in bold): 



B OB* 
ACCCA GCTTTCTTGTACAAA GtGGT (SEQ. ID NO: 8) 

tGggT cgaaaGaacatgttt CaCca 

The attL3 sequence was cloned m place of attL2 of an existing Gene Donor 
plasmid to give the plasmid pEZC2901 (Figure 7A). The atlR3 sequence was 
cloned m place of attR2 m an existing Vector Donor plasnud to give plasmid 
pEZ0913 (Figure 7B) Dpn I gene was cloned into plasmid pEa:2913 to replace 
tiie tet repressor gene. The resulting Vector Donor plasmid was named 
pEZC3101 (Figure 7C), When pEZC310l was transformed mto tiie dam' stram 
SCSllO(Stratagene),hundredsofcoloniesresulted. When the same plasmid was 
transformed into tiie dam+ strain DH5a, only one colony was produced, even 
tiiough tiie DH5a cells were about 20 fold more competent tiian the SCSI 10 
cells. When a related plasmid tiiat did not contam tiie Dpn I gene was 
transformed into the same two cell lines, 28 colonies were produced from tiie 
SCSI 10 cells, while 448 colonies resulted fiom tiie DH5a cells. This is evidence 



,™ ^.An»^A PCT/US96/10082 
WO 96/40724 

-46- 

that the Dpn I gene is being C3q)ressed on plasmid pEZC3101 (Figure 7C), and 
fliat it is killing the dam* DH5a cells but not the dam- SCS 110 cells. 



PartJU: DenwnstraihnofRecontbinaihnaiOoningUsin^ 

A pair of plasmids was used to demonstrate reoombinational cloning wfli 
5 selection for product dependent upon the toMC gene Dpnl. Plasmid pEZC3 101 

figure IQ was linearized wifli Mlu I and reacted wifli circular plasmid 
pEZC2901 (Figure 7A), A second pdr of plasmids using selection based on 
control of drug resistance by a repressor gwie was used as a control: plasmid 
pEZC1802 figure 7D) was linearized with Xba I and reacted wth dicular 
10 plasmid pEZClSQZ CFigurc 5H). Eigjit microliter reactions containing the same 

buflFcr and proteins 3Gs, Int, and IHF as m previous examples were incubated for 
45 mnrates at IS^C, tiien 10 nmnites at 75*C, and 1 fA aliquots weae transformed 
into DH5a (Le., dam^-) corrq)etent cells, as presented in Table 6. 



Table 6 



Reaction # 


Vector donor 


Basis of selection 


Gene donor 


Colonies 


1 


pEZC3101/Mlu 


Dpn I toxicity 




3 . 


2 


pEZC310l/Mlu 


Dpn I toxicity 


Circular pEZC290l 


4000 


3 


pEZCl802/Xba 


Tet repressor 




0 


4 


pEZC1802«ba 


Tet repressor 


Circular pEZC1502 


12100 



20 Miniprq) DNAs were prepared fiom four colonies fix)m reaction #2 , and 

cut wth restriction enzyme SspL All gave the predicted fiagments. 

Analysis: Subcloning using selection with -a tone gene was 
demonstrated. Plasmids ofthepre<hcted structure were produced. 
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Exantple 6: Cloning of Genes with UracUDNA Glycosylase and Subdoning 
of the Genes with Kecombinationai Ooning to Make Fusion Proteins 



Part I: Converting an Ebdsting Expression Vector to a Vector Donor for 
Recombinational Ooning 

5 A cassette useM for ooaverting existing vectors into functional Vector 

Donotswasinadeasfonows. PlasmidpEZC3l01 (Figure 7Q was digested witii 
Apa I and Kpn I, treated with T4 DNA polymerase and dNIPs to render the ends 
blunt, further digested with Sma I, Hpa I, and AlwN I to render the undesirable 
DNA fiagments small, and the 2 .6 kb cassette contaimng the attRl - Cm^ - Dpn I 
10 . attR-3domaim was gel purified The concentisrtion of tiie purified ca^ 

estimated to be about 75 ng DNA/jd. 

Plasmid pGEX-2TK: (Figure 8 A) (Pharmacia) allows fusions between the 
protein glutathione S transferase and any second coding sequence that can be 
inserted in its multiple cloning site, pGEX-2TK DNA was digested witii Sma I 
15 aiid treated with alkaline phosphatase. About 75 ngofthe above purified DNA 

cassette was Ugated vnAi about 100 ng of &e pGEX-2TK vector for 2.5 hours in 
a 5 id ligation, thai 1 |d was transformed into conqwtent BRL 3 05 6 cells (a dam- 
derivative of DHIOB; dam* strams commercially avsdlable uiclude DM1 fiom 
Life Technologies, Inc., and SCS 110 ftom Stratagene). AHquots of the 
20 transformation mixture were plated on LB agar contdiung 100 |ig/ml ampicillm 

(resistance gene present on pGEX-2TK) and 30 jig/ml chloramphenicol 
(resistance gene present on the DNA cassette). Colonies were picked and 
imniprcp DNAs were made. The orientation of the cassette in pGEX-2TK was 
determined by diagnostic cuts with EcoR 1. A plasmid witii the desired 
25 orientation was named pEZC3501 (Figure 8B). 



wo 96/40724 



-48- 



PCTAJS96/10082 



Fan U: Cloning Reporter Genes Into an Recombinational Cloning Gene 
Donor Flasmid in Three Reading Frames 

Uztunl DNA giycosylase (UDG) cloning is a method for cloning PGR 
aiiq)lificadon products into dealing vectors (U.S. patent No. 5,334,5 15, entirely 
incoiporated herein reference). Briefly, PGR ampUficadon of the desired 
DNA segment is performed with primets diat contain uracil bases in place of 
thymidmebasesintiieirS'ends. When such PGR products are incubated with the 
enzyme UDG, the uracil bases are specifically removed. The loss of these bases 
weakens base pairing in the ends of the PGR product DNA, and -when incubated 
at a suitable temperature (e.g«, 37°G), the ends of such products are largely single 
stranded. If such incubations arc done in the presence of linear cloning vectors 
contaming protrudii^ 3* tails that are complementaiy to the 3* ends of the PGR 
products, base pairing efSciently anneals the PGR products to the cloning vector. 
When the annealed product is introduced into £. coU cells by transformadon, in 
vivo processes effidentiy convert it into a recombinant plasmid. 

UDG clorui^ vectors that enable donuig of any PGR product in all three 
reading fiames were prepared ftom pEZG3201 (Figure 8K) as follows. Eight 
oligonudedtides wkc obtamed from Life Technologies, Inc. (all written 5' -* 3*: 
rfl top (GGGG GAT TAG GAT ATG GGA AGG AGG GAA AAG GTG TAT 
TTT GAG GGT) (SEQ. ID N0:19), rfl bottom (GAG GTT TTG GGT GGT 
TGG GAT ATG GTA ATCXSEQ. ID NO:20), rC top (GGCCA GAT TAG 
GAT ATG GGA AGG AGG GAA AAG GTG TAT TTT GAG GGT)(SEQ. ID 
N0:21), rfl bottom (GAG GTT TTG GGT GGT TGG GAT ATG GTA 
ATG TXSEQ. ID NO:22), rf3 top (GGCCAA GAT TAG GAT ATG CCA ACG 
AGG GAA AAG GTG TAT TTT GAG GGT)(SEQ. ID NO:23), rf3 bottom 
(GAG GTT TTG GGT GGT TGG GAT ATG GTA ATG TT)(SEQ, ID NO:24), 
carboiy top (AGG GTT TAG GTG GAC)(SEQ. ID NO:25) and carboxy 
bottom CTGGA GTG GAG GTA AAG GGT TGG GAG TTA TTA)(SEQ. ID 
NO:26). The rfl, 2, and 3 top strands and the carboxy bottom strand were 
pho3phorylated on thek 5' ends with T4 polynucleotide Idnase, and then the 
complementary strands of each pair were hybridized. Plasnud pEZG3201 
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(Figuie 8K) was cut wlhNotI and Sal I, and aKquots of cut plasmid were mixed 
wi4i tibe caiboxy-oligo duplex (Sal I end) and either the rfl , rf2, or rf3 duplexes 
(Not I ends) (10 ng cut plasmid (about 5 pmol) mixed with 250 pmol carboxy 
oligo duplex, split into toee 20 jil volumes, added 5 ^il (250 pmol) of ifl, rf2, or 
tQ diq)lex and 2 jil = 2 units T4 DNA Ugase to each reaction). After 90 nunutes 
of ligation at room temperature, each reaction was applied to a preparative 
agarose gel and the 2.1 kb vector bands were eluted and dissolved in 50 |il of TE. 

Part HI: PCR ofCATandphoA Genes 

Primers were obtmned from Life Technologies, Inc., to amplify the 
diloramphenicol acetyl transferase (CAT) gene from plasmid pACYC184, and 
phoA, the alkaline phosphatase gene from E. colL The primers had 12-base 5* 
ext^ioQS contBinii^ uracil bases, so that treatment of PCR products with uracil 
DNA glycosylase (UE>G) would weaken base psdring at each end of the DNAs 
and allow tiie 3' strands to anneal with the protruding 3* ends of the rfl, 2, and 3 
vectors described above. The sequences of the primers (all written 5* - 3') 
were: CATleft,UAUUlJUCAGGGU ATGGAG AAAAAA ATC ACTGGA 
TAT ACC (SEQ. ID NO:27); CAT right, UCC CAC UUA UUA CGC CCC 
GCC CTG CCA CTC ATC (SEQ. ID NO:28); phoA left, UAU UUU CAG GGU 
ATG CCT GTT CTG GAA AAC CGG (SEQ. ID NO;29); and phoA right, UCC 
CAC UUA UUA TTT CAG CCC CAG GGC GGC TTT C (SEQ. ID NO:30). 
The primers were then used for PCR reactions using known method steps {see, 
e.g., U.S. patent No. 5,334,515, entirely incorporated herein by reference), and 
the polymerase chain reaction amplification products obtained witii these primers 
comprised tiie CAT or phoA genes witii the initiating ATGs but without any 
transcriptional signals. In addition, the uradl-containing sequences on the amino 
termird encoded die cleavage site for TEY protease CLife Technologies, Inc.), and 
ttiose on die carboxy temunal encoded consecutive TAA nonsense codons. 

Unpurified PCR products (about 30 ng) were mbced witii tiie gel purified, 
linear rfl, r£2, or rG cloning vectors (about 50 ng) in a 1 0 ^1 reaction containing 
IX REact 4 buffer (LTI) and 1 unit UDG (LTI). After 30 minutes at 37*C, 1 jil 
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aliquots of each reaction were transformed into competent E. coli DH5a cells 
(LH) and plated on agar containing 50 fig/ml kanamycin. G)lonies were picked 
and analysis of miniprcp DN A showed that the CAT gene had been cloned in 
reading frame 1 0)EZC3601XFiguie 8C), reading frame 2 (pEZC3609XFigure 
5 8D) and reading frame 3 (pEZC3617)(Figure 8E), and that the phoA gene had 

been cloned in reading frame 1 CpEZC3606)^iguie 8F), rcadmg frame 2 
(pEZC3613XFigure 8G) and reading frame 3 (pEZC3621)(Figure SHQ. 

PartW: SubdoiungofCATorpkoAfrom VDG Ooning Vectors into a GST 
Fusion Vector 



10 Fiasmids encoding fusions between GST and either CAT or phoA in all 

three reading frames were constructed by recombinational cloning as follows. 
Miniprep DNA of GST vector donor pEZC3501(Figure 8B) (derived from 
Pharmacia plasmid pG£X-2TK as described above) was linearized with Cla I. 
About 5 ng of vector donor were mixed with about 1 0 ng eadi of the appropriate 

1 5 drcular gene donor vectors contmning CAT or pho A in 8 fil reactions contmning 

buffer and recombination proteins Int,Xis, and IHF (above). After incubation, 
1 ^1 of each reaction was transformed into £. coli strain DH5a and plated on 
. ampidllin, as presented in Table 7. 



Table 7 



DNA 


Colonies (10% of each transformation) 


Linear vector donor (pEZC3501/Cla) 


0 


Vector donor + CAT rfl 


no 


Vector donor + CAT rfZ 


71 


Vector donor + CAT rG 


148 


Vector donor + phqA rfl 


121 


Vector donor + phoA rf2 


128 


Vector donor + phoA rO 


31 
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PartV: Ej^ression of Fusion Protons 



Two colonies from each transfonnation were picked into 2 ml of rich 
medium (CircleGiow, BiolOl Inc.) in 17 x 100 nmi plastic tubes (Falcon 2059, 
Becton Dickinson) containing 100 jig/ml ampicillin and shaken vigorously for 
about 4 hours at 37 **C, at which time the cultures were vbibly turbid. One ml of 
each culture was transferred to a new tube containing 10 ^1 of 10% (w/v) IPTG 
to induce expression of GST. After 2 hours additional incubadon, all cultures had 
about the same turbidity; die A600 of one culture was 1 Cells from 0.35 ml 
each culture were harvested and treated with sample buffer (containing SDS and 
P-mercaptoethanol) and aUquots equivalent to about 0.15 A600 uruts of cells 
were lulled to a Novex 4-20% gradient polyaci^damide gel Followmg 
electrophoresis the gel was stained vnih Copmas^e blue. 

Results: Enhanced ejq>ression of single protein bands was seen for all 12 
cultures. Ihe observed sizes of Aese proteins correlated well with die sizes 
predicted for GST being fused (through adB recombination sites without stop 
codons) to CAT or phoA in three reading frames: CAT rfl = 269 amino acids; 
CAT rf2 = 303 amino acids; CAT rO = 478 amino acids; phoA rfl = 282 amino 
acids; pho A rf2 = 280 amino acids; and phoA rO = 705 amino acids. 

Analysis: Both CAT and phoA genes were subcloned into a GST fusion 
vector in all three reading frames, and expression of the six frision proteins was 
demonstrated. 

While the foregoing invention has been described in some detail for 
purposes of clarity and understanding, it will be appreciated by one skilled in the 
art from a reading of this disclosure that various changes in form and detail can 
be made without departing from die true scope of the invention and appended 
claims. All patents and publications cited herein are entirely incorporated herein 
by reference. 
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1. A Vector Donor DNA molecule conpidiig a first DNA sclent 
and a second DNA s^ait, said first or second DNA SQgm^t containing at least 
one Selectable marker, ^/rfierein flie first and second segments are sqsarated either 
by,(i) in a circular Vector Donor, a first and a second recombination site, or 
Cii) in a linear Vector Donor, at least a first recombination site, wherein eadi pair 
of flanking recombination sites are engineered and do not rccombine with eadi 
other. 

2. A Vector Donor DNA molecule according to claim 1, wherein flie 

Selecteble marker is at least one DNA segment selected from the group consi^ 
of: 

ffl a DNA segment that encodes a product that provides 

resistance a^inst otherwise toxic compounds; 
0i) a DNA segment that racodes a product that is otherwise 

lacMng m the redpient cell; 
(iii) a DNA segment that encodes a product that suppresses the 

activity of a gene product; 
Ov) a DNA segment that encodes a product that can be readily 

identified; 

(v) a DNA s^ent that wicodes a product that is detrimental 
to cell survival and/or fimction; 

(vi) a DNA segment fliat inhibits fte activity of any of the 
DNA segments of OKv) above ; 

(vii) a DNA segment that bmds a product that "modifies a 

substrate; 

(viii) a DNA segment that provides for isolation of a desired 
molecule; and 

(ix) a DNA segment that encodes a specific nucleotide 
sequence which can be otherwise non-fimctional; and 
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(x) a DNA segment that, when absent, directly or indirectly 
confers sensitivity to particular compounds. 

3, A Vector Donor DNA according to claim 2, wherein said 
Selectable- marker is at least one selected from the group consisting of an 

5 antibiotic resistance gene, a tRNA gene, an auxotrophic marker, a toxic gene, a 

phenot/pic marker, an antisense oligonucleotide; a restriction endonuclease; a 
restriction endonuclease cleavage site, an en2yme cleavage site, a protein binding 
^te; and a sequence complementary PGR primer. 

4. AVeGtorDonorDNAmoleculeaccordingtoclaim 1, \rtiereinsaid 
10 Selectab!& marker comprises at least one inactive fiagment of a Selectable 

marker, wherein the inactive fiagment is c£^able of reconstituting a functional 
Selectable marker when recombined across said first or second recombination site 
with a further DNA segm^t comprising another inactive fragment of the 
Selectable maiker. 

15 5. An Insert Donor DNA molecule, comprising a desired DNA 

segment flanked by a first recombination site and a second recombination site, 
wherein the first and second recombination sites are ragineered and do not 
recombine with each other. 

6, An Insert Donor DNA molecule according to claim 5, wherein 
20 said desired DNA segment codes for at least one selected from the group 

consisting of a cloning site, a restriction site, a promoter, an operon, an origin of 
replication, a fimctional DNA, an antisense RNA, a PGR fragment,-a protein or 
a protein fragment. 

7. A kit comprising a container being compartmentalized to receive 
25 in close confinement therein at one compartment, v^erein a first compartment 

contains a Vector Donor DNA molecule comprising a first DNA segment and a 
second DNA segment, said first or second DNA segment contmning at least one 



WO9d/40724 PCT/US96/10082 

-54- 

Selectable marker, wherein the first and second segments are flanked either by, 
(i) in a circular Vector Donor, a first and a second recombination site, or (ii) in 
a linear Vector Donor, a first recombination site, \v^erein each pair of flanking 
recombination sites are engineered and do not recombine with each other. 

A kit according to cldm 7, fiirtfaer comprising a second 
compartment containing an Insert Donor DNA molecule comprising a desired 
DNA s^eat flanked by a first xecombmation site and a second recombination 
^te, ^riieiein the first and second recombination sites axe engineeied and do not 
recombine with each other. 

9. A kit acooidmg to clmm 7, fiirdier compri^ng an additional 
compartment containmg at least one recombination protdn c^ble of 
recombining a DNA segment comprising at least one of said recombination ^tes. 

10. A nucleic acid molecule, comprising at least one DNA segment 
having at least two recombination sites flankmg a Selectable marker or a desired 
DNA segment, wherein at least one of said recombination sites comprises a core 
region having at least one engineered mutation that enhances recombmation in 
vitro in tixe formation of a Cointegrate DNA or a Product DNA, 

11. A nucleic acid molecule according to claim 10, vsiierein said 
mutation confers at least one enhancement of said recombmation, said 
enhancement selected fix)m tiie group consisting of substantially (i) favoring 
excisive recombination; (n) fevoring integrative recombination; Ciii) relieving the 
requhement for host fiictors; (iv) increasmg tiie efficiency of said Cointegrate 
DNAorProduct DNA fomiation;(v) increasing flie specificity of said Cointegrate 
DNA or Product DNA formatioi^ and contributes desirable attributes to tiie 
Product DNA. 
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12. A nucleic add molecule according to claim 10, vAierdn said 
lecombinadon dte is derived bom at least one xecombinatioii site selected fom 
the grotq> consisting ofattB, attP, attL and attR. 

13. A nucleic acid molecule according to claim 12, wherein said att 
site is selected from the groups consisting of attl, att2 and att3. 

14. A nucleic acid according to claim 10, wherein said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) RKYCWCKnTITKmTACNAASTSGB(m-a^^ 

(b) AGCOyGCnTTYKTRTACNAACTSGB (m-attB) (SEQ ID 

N0:2); 

(c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID 

N0:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-atiL) (SEQ ID 

N0:4); 

(e) GTTCAGCTrrYKTRTACNAAGTSGB(m-attPl) (SEQ ID 

N0:5); 

and a correqwnding or cpmplemcntaiy DNA or RNA sequence, wherein R=A or 
G; K=G or T/U; Y=C or T/U; W=A or TAJ; N^A or C or G or T/U; S<:or G; 
andB=CorGorT/U. 



15. A nucleic acid according to claim 14, wherein said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) AGCCTGCTTTTrrGTACAAACITGT(attBl)(SEQroN0:6); 

(b) AGCCTGCrrTCTTGTACAAACTTGT(attB2)(SEQIDNO:7); 

(c) ACCCAGCnTCTrGTACAAACITGT (attB3) (SEQ ID N0:8); 

(d) GTTCAGCTITriTGTACAAACrTGT^^ 

(e) GTrCAGClTrCnTGTACAAACrrGT(fflttR2)(SEQIDNai^^^^ 

(f) GTTCAGCTTTCTTGTACAAAGTTGG (attR3) (SEQ ID 
N0:11); 
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(g) AGCCrGCTTTTTTGTACAAAGTTGG (attLl) (SEQ ID 
N0:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGO (altL2) (SEQ ID 
N0:13); 

(i) ACCCAGCmCTTGTACAAAGTTGG (attL3) (SEQ ID 
N0:14); 

0) GTTCAGCTriTTTGTACAAAGTTGG(attPl) (SEQ ID N0:15); 
(k) GTTCAGCTTTCTTGTACAAAGTTGG (attP2,P3) (SEQ ID 
N0:16); 

and a conesponding or complanraitaty DNA or RNA sequence. 

16. A mefliod for making a nucleic acid molecule, comprising 
providing a nuddc acid molecule having at least one engineered 

recombination site comprising at least one DNA sequence haying at least 90% 

homology to at least one of SEQ ID N0S:1-16. 



15 17. A nucleic acid molecule provided by a method according to 

claim 16. 

1 8. A composition, comprising a nucleic acid molecule according to 
cimm 10. 



10 



20 



19, A kit, comprising a container being compartmentalized to receive 
in close confinement therem at least one compartment, v^crein a first 
compartment contains a composition according to claim 18*. 



20. A kit accordii^ to claim 19, fiirtha: comprising a second 
compartment having at least one recombmation protein that reco^iizes said 
recombination site. 
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21 . A kit comprising a container being compartmentalized to receive ' 
in close confinement therein at least one recombination protein in isolated form, 
useful for a method according to claim 22. 

22. A method of making a Cointegrate DNA molecule, comprising 
combining in vitro: 

(i) an Insert Donor DNA molecule, comprising a desired 
DNA segnient flanked by a first recombination site and a 
second recombination site, wherein the first and second 
recombination sites do not recombine vdth each otfa^ 
&) a Vector Donor DNA molecule containmg a tiurd 
recombination site and a fourth recombmation site, 
wherein die third and fourth recombination sites do not 
recombine witii eadi other, and 
Qn) at least one site spedfic recombination protein capable of 
recombmmg said first and third recornbinational dtes said 
second and fourth recombinational sites; 
thereby allowing recombination to occur, so as to produce a Cointegrate DNA 
molecule con^rising said first and third or said second and fourtii recombmation 
sites. 

23. A method according to claim 22, wh^ein a Product DNA 
molecule is produced fcom said Cointegrate DNA by recombining at least one of 
0) said first and third, or (ii) said second and fourth, recombination sites, said 
Product DNA comprising said desired DNA segment. 

. 24. A method acconfing to claim 23, wherem said method also 
produces a Byproduct DNA molecule. 



25. A method according to claim 23, further comprising selecting for 
die Product DNA molecule. 
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26. A method as claimed in claim 22, \dieiein the Vector Donor DNA 
molecule comprises a vector segment flanked by said third and die fourdi 
recombination sites. 

27. A method as claimed in claim 22, \^1ierem the Vector Donor DNA 
molecule further comprises (a) a toxic gene and (b) a Selectable maricer, wdierein 
the toxic gene and the Selectable maricer are on different DNA segments, the 
DNA segments bemg separated either by 0) in a circular DNA molecule, two 
recombination sites, or (ii) in a linear DNA molecule, one recombination site. 

28 . A method as claimed in claim 22, vd^rdn the Vector Donor DNA 
mdeciile fiirtha: comprises (a) a represaon cassette and (b) a Selectable maricw, 
rqwessed by die iq)ressor of the repression cassette, and vrhimin the Selectable 
markCT and the iqiression cassette are on diflfacnt DNA stents, the DNA 
stents being separated cither by. 0) in a circular DNA molecule, two 
recombmation sites, or (ii) in a linear DNA molecule, one recombmation site. 

29. A method as claimed in claim 22, wherem at least one of die Insert 
Donor DNA molecule and the Vector Donor DNA molecule is a circular DNA 
molecule. 



30. A method as claimed in claim 22, wherein at least one of the Insert 
Donor DNA molecule and the Vector Donor DNA molecule is a linear DNA 

. molecule. 

31. A method as claimed in claim 22, wherein the selecting step is 
carried out in vitro ot in vivo, 

32. A method as claimed in claim 22, \Aercin said recombination 
protein comprises at least a first recombination protein and a second 
recombination protein, said second recombmation protein being dififerent from 
said fust recombination protein. 



W6W40724 PCT/US96/10082 

-59- 

33. A melfaod as claimed in claim 22, wherein said recombination 
protdnisint 



34. A method as clauned in claim 22, \siierein the at least one 
recombination protein is selected from ® Int and IHF and (Li) Int, Xis, and IHF. 
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